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TO ALL WHOM IT MAY CONCERN: 

Be it known that We, S. Gary Mansfield, Lloyd G. Mitchell, Mariano A. 
Garcia-Blanco, Christopher E. Walsh, citizens of the United States, and Hengjun Chao, a 
citizen of the People's Republic of China, whose post office addresses are 1005 Prologue 
Road, Durham North Carolina, 4500 Highgate Drive, Durham, North Carolina 27713, 
and 12 Sanderling Court, Durham, North Carolina 27713, 504 Bayberry Drive, Chapel 
Hill, North Carolina, and 501 NC Highway 54, Carboro, North Carolina, respectively, 
have invented an improvement in 

"Methods and Compositions for Use in Spliceosome 
Mediated RNA Trans-splicing " 

of which the following is a 

SPECIFICATION 

The present application is a continuation-in-part of pending application 
serial number 09/756096 filed January 8, 2001which is a continuation-in-part of pending 
application serial number 09/158,863 filed September 23, 1998 which is a continuation- 
in-part of serial number 09/133,717 filed on August 13, 1998 which is a continuation-in- 
part of serial number 09/087,233 filed on May 28, 1998, which is a continuation-in-part 
of pending application serial number 08/766,354 filed on December 13, 1996, which 
claims benefit to provisional application number 60/008,317 filed on December 15, 1995. 

The present invention was made with government support under Grant 
Nos. SBIR R43DK56526-01 and SBIR R44DK56526-02. The government has certain 
rights in the invention. 

1. INTRODUCTION 

[0001] The present invention provides methods and compositions for generating novel 
nucleic acid molecules through targeted spliceosomal /rans-splicing. The compositions of 
the invention include pre-*ra«.s-splicing molecules (PTMs) designed to interact with a natural 
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target precursor messenger RNA molecule (target pre-mRNA) and mediate a trans-splicing 
reaction resulting in the generation of a novel chimeric RNA molecule (chimeric RNA). The 
PTMs of the invention are genetically engineered so as to result in the production of a novel 
chimeric RNA which may itself perform a function, such as inhibiting the translation of the 
RNA, or that encodes a protein that complements a defective or inactive protein in a cell, or 
encodes a toxin which kills specific cells. Generally, the target pre-mRNA is chosen as a 
target because it is expressed within a specific cell type thus providing a means for targeting 
expression of the novel chimeric RNA to a selected cell type. The invention further relates 
to PTMs that have been genetically engineered for the identification of exon/intron 
boundaries of pre-mRNA molecules using an exon tagging method. In addition, PTMs can 
be designed to result in the production of chimeric RNA encoding for peptide affinity 
purification tags which can be used to purify and identify proteins expressed in a specific cell 
type._The methods of the invention encompass contacting the PTMs of the invention with 
a target pre-mRNA under conditions in which a portion of the PTM is trans-spliced to a 
portion of the target pre-mRNA to form a novel chimeric RNA molecule. The methods and 
compositions of the invention can be used in cellular gene regulation, gene repair and suicide 
gene therapy for treatment of proliferative disorders such as cancer or treatment of genetic, 
autoimmune or infectious diseases. In addition, the methods and compositions of the 
invention can be used to generate novel nucleic acid molecules in plants through targeted 
splicesomal trans-splicing. For example, targeted trans-splicing may be used to regulate 
gene expression in plants for treatment of plants diseases, engineering of disease resistant 
plants or expression of desirable genes in plants. The methods and compositions of the 
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invention can also be used to map intron-exon boundaries and to identify novel proteins 
expressed in any given cell. 

2. BACKGROUND OF THE INVENTION 
[0002] DNA sequences in the chromosome are transcribed into pre-mRNAs which 
contain coding regions (exons) and generally also contain intervening non-coding regions 
(introns). Introns are removed from pre-mRNAs in a precise process called splicing (Chow 
et aU 1977, Cell 12:1-8; and Berget, S.M. et al 9 1977, Proc. Natl. Acad. Sci. USA 74:3171- 
3175). Splicing takes place as a coordinated interaction of several small nuclear 
ribonucleoprotein particles (snRNP ! s) and many protein factors that assemble to form an 
enzymatic complex known as the spliceosome (Moore et aL 9 1993, in The RNA World, R.F. 
Gestland and J.F. Atkins eds. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y.); Kramer, 1996, Annu. Rev. Biochem., 65:367-404; Staley and Guthrie, 1998, Cell 
92:315-326). 

[0003] Pre-mRNA splicing proceeds by a two-step mechanism. In the first step, the 
5' splice site is cleaved, resulting in a "free" 5' exon and a lariat intermediate (Moore, MJ. 
and P.A. Sharp, 1993, Nature 365:364-368). In the second step, the 5' exon is ligated to the 
3 1 exon with release of the intron as the lariat product. These steps are catalyzed in a 
complex of small nuclear ribonucleoproteins and proteins called the spliceosome. The 
splicing reaction sites are defined by consensus sequences around the 5 ? and 3' splice sites. 
The 5' splice site consensus sequence is AG/GURAGU (where A=adenosine, U = uracil, G 
= guanine, C = cytosine, R = purine and / = the splice site). The 3 ! splice region consists of 
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three separate sequence elements: the branch point or branch site, a polypyrimidine tract and 
the 3' splice consensus sequence (YAG). These elements loosely define a 3' splice region, 
which may encompass 100 nucleotides of the intron upstream of the 3* splice site. The 
branch point consensus sequence in mammals is YNYURAC (where N = any nucleotide, Y= 
pyrimidine). The underlined A is the site of branch formation (the BPA = branch point 
adenosine). The 3 1 splice consensus sequence is YAG/G. Between the branch point and the 
splice site there is usually found a polypyrimidine tract, which is important in mammalian 
systems for efficient branch point utilization and 3' splice site recognition (Roscigno, R., F. 
et al, 1993, J. Biol. Chem. 268:11222-11229). The first YAG trinucleotide downstream 
from the branch point and polypyrimidine tract is the most commonly used 3 ! splice site 
(Smith, C.W. et al, 1989, Nature 342:243-247). 

[0004] In most cases, the splicing reaction occurs within the same pre-mRNA 
molecule, which is termed ds-splicing. Splicing between two independently transcribed pre- 
mRNAs is termed ^raws-splicing. Trans-splicing was first discovered in trypanosomes 
(Sutton & Boothroyd, 1986, Cell 47:527; Murphy et al, 1986, Cell 47:5 17) and subsequently 
in nematodes (Krause & Hirsh, 1987, Cell 49:753); flatworms (Rajkovic et al, 1990, Proc. 
Natl. Acad. Sci. USA, 87:8879; Davis et al, 1995, J. Biol. Chem. 270:21813) and in plant 
mitochondria (Malek et al, 1997, Proc. Natl. Acad. Sci. USA 94:553). In the parasite 
Trypanosoma brucei, all mRNAs acquire a splice leader (SL) RNA at their 5' termini by 
fraws-splicing. A 5 1 leader sequence is also fraws-spliced onto some genes in Caenorhabditis 
elegans. This mechanism is appropriate for adding a single common sequence to many 
different transcripts. 
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[0005] The mechanism of fram-splicing, which is nearly identical to that of 
conventional czs-splicing, proceeds via two phosphoryl transfer reactions. The first causes 
the formation of a 2-5' phosphodiester bond producing a f Y f shaped branched intermediate, 
equivalent to the lariat intermediate in <m-splicing. The second reaction, exon ligation, 
proceeds as in conventional czs-splicing. In addition, sequences at the 3' splice site and some 
of the snRNPs which catalyze the /ram-splicing reaction, closely resemble their counterparts 
involved in czs-splicing. 

[0006] Tram-splicing may also refer to a different process, where an intron of one pre- 
mRNA interacts with an intron of a second pre-mRNA, enhancing the recombination of 
splice sites between two conventional pre-mRNAs. This type of fram-splicing was 
postulated to account for transcripts encoding a human immunoglobulin variable region 
sequence linked to the endogenous constant region in a transgenic mouse (Shimizu et 
a/., 1989, Proc. Nat'l. Acad. Sci. USA 86:8020). In addition, /ram-splicing of c-myb pre- 
RNA has been demonstrated (Vellard, M. et al. Proc. Natl. Acad. Sci., 1992 89:251 1-2515) 
and more recently, RNA transcripts from cloned SV40 *ram-spliced to each other were 
detected in cultured cells and nuclear extracts (Eul et aL, 1995, EMBO. J. 14:3226). 
However, naturally occurring ^ram-splicing of mammalian pre-mRNAs is thought to be an 
exceedingly rare event. 

[0007] In vitro ^ram-splicing has been used as a model system to examine the 
mechanism of splicing by several groups (Konarska & Sharp, 1985, Cell 46:165-171 
Solnick, 1985, Cell 42:157; Chiara & Reed, 1995, Nature 375:510; Pasman and Garcia- 
Blanco, 1996, Nucleic Acids Res. 24:1638). Reasonably efficient ^ram-splicing (30% of cis- 
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spliced analog) was achieved between RNAs capable of base pairing to each other, splicing 
of RNAs not tethered by base pairing was further diminished by a factor of 10. Other 
in vitro ^raw-splicing reactions not requiring obvious RNA-RNA interactions among the 
substrates were observed by Chiara & Reed (1995, Nature 375:510), Bruzik J.P. & Maniatis, 
T. (1992, Nature 360:692) and Bruzik J.P. and Maniatis, T., (1995, Proc. Natl. Acad. Sci. 
USA 92:7056-7059). These reactions occur at relatively low frequencies and require 
specialized elements, such as a downstream 5 ! splice site or exonic splicing enhancers. 
[0008] In addition to splicing mechanisms involving the binding of multiple proteins 
to the precursor mRNA which then act to correctly cut and join RNA, a third mechanism 
involves cutting and joining of the RNA by the intron itself, by what are termed catalytic 
RNA molecules or ribozymes. The cleavage activity of ribozymes has been targeted to 
specific RNAs by engineering a discrete "hybridization" region into the ribozyme. Upon 
hybridization to the target RNA, the catalytic region of the ribozyme cleaves the target. It 
has been suggested that such ribozyme activity would be useful for the inactivation or 
cleavage of target RNA in vivo, such as for the treatment of human diseases characterized 
by production of foreign of aberrant RNA. The use of antisense RNA has also been 
proposed as an alternative mechanism for targeting and destruction of specific RNAs. In 
such instances small RNA molecules are designed to hybridize to the target RNA and by 
binding to the target RNA prevent translation of the target RNA or cause destruction of the 
RNA through activation of nucleases. 

[0009] Until recently, the practical application of targeted ^raws-splicing to modify 
specific target genes has been limited to group I ribozyme-based mechanisms. Using the 
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Tetrahymena group I ribozyme, targeted frans-splicing was demonstrated in E. coli. coli 
(Sullenger B.A. and Cech. T.R., 1994, Nature 341:619-622) , in mouse fibroblasts (Jones, 
J.T. et al, 1996, Nature Medicine 2:643-648), human fibroblasts (Phylacton, L.A. et al. 
Nature Genetics 18:378-381) and human erythroid precursors (Lan et al, 1998, Science 
280:1593-1596). While many applications of targeted RNA trans-splicing driven by 
modified group I ribozymes have been explored, targeted fra/w-splicing mediated by native 
mammalian splicing machinery, i.e., spliceosomes, has not been previously reported. 

3. SUMMARY OF THE INVENTION 
[0010] The present invention relates to compositions and methods for generating novel 
nucleic acid molecules through spliceosome-mediated targeted ^raws-splicing. The 
compositions of the invention include pre-£ra«s-splicing molecules (hereinafter referred to 
as "PTMs") designed to interact with a natural target pre-mRNA molecule (hereinafter 
referred to as "pre-mRNA") and mediate a spliceosomal /rans-splicing reaction resulting in 
the generation of a novel chimeric RNA molecule (hereinafter referred to as "chimeric 
RNA"). The methods of the invention encompass contacting the PTMs of the invention with 
a natural target pre-mRNA under conditions in which a portion of the PTM is spliced to the 
natural pre-mRNA to form a novel chimeric RNA . The PTMs of the invention are 
genetically engineered so that the novel chimeric RNA resulting from the frans-splicing 
reaction may itself perform a function such as inhibiting the translation of RNA, or 
alternatively, the chimeric RNA may encode a protein that complements a defective or 
inactive protein in the cell, or encodes a toxin which kills the specific cells. Generally, the 
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target pre-mRNA is chosen because it is expressed within a specific cell type thereby 
providing a means for targeting expression of the novel chimeric RNA to a selected cell type. 
The target cells may include, but are not limited to those infected with viral or other 
infectious agents, benign or malignant neoplasms, or components of the immune system 
which are involved in autoimmune disease or tissue rejection. JThe PTMs of the invention 
may also be used to correct genetic mutations found to be associated with genetic diseases. 
In particular, double-fra/^-splicing reactions can be used to replace internal exons. The 
PTMs of the invention can also be genetically engineered to tag exon sequences in a mRNA 
molecule as a method for identifying intron/exon boundaries in target pre-mRNA. The 
invention further relates to the use of PTM molecules that are genetically engineered to 
encode a peptide affinity purification tag for use in the purification and identification of 
proteins expressed in a specific cell type. The methods and compositions of the invention 
can be used in gene regulation, gene repair and targeted cell death. Such methods and 
compositions can be used for the treatment of various diseases including, but not limited to, 
genetic, infectious or autoimmune diseases and proliferative disorders such as cancer and to 
regulate gene expression in plants. 

4. BRIEF DESCRIPTION OF THE DRAWINGS 
[0011] Figure 1 A. Model of Pre-Traw^-splicing RNA. 

[0012] Figure IB. Model PTM constructs and targeted ^a^-splicing strategy. 
Schematic representation of the first generation PTMs (PTM+Sp and PTM-Sp). BD, binding 
domain; NBD, non-binding domain; BP, branch point; PPT, pyrimidine tract; ss, splice site 
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and DT-A, diphtheria toxin subunit A. Unique restriction sites within the PTMS are 
indicated by single letters: E; EcoRI; X, Xhol; K, Kpnl; P, Pstl; A, Accl; B, BamHI and H; 
HindllL 

[0013] Figure 1C. Schematic drawing showing the binding of PTM+Sp via 
conventional Watson Crick base pairing to the pHCG6 target pre-mRNA and the proposed 
cis- and trans-splicing mechanism. 

[0014] Figure 2A. In vitro ^raws-splicing efficiency of various PTM constructs into 
PHCG6 target. A targeted binding domain and active splice sites correlate with PTM trans- 
splicing activity. Full length targeted (pcPTM+Sp), non-targeted (PTM-Sp) and the splice 
mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] PTM RNAs were added to splicing reactions 
containing pHCG6 target pre-mRNA. The products were RT-PCR amplified using primers 
pHCG-F (specific for target pHCG6 exon 1) and DT-5R (complementary to DT-A) and 
analyzed by electrophoresis in a 1.5% agarose gel. 

[0015] Figure 2B. In vitro trans-splicing efficiency of various PTM constructs. Full 
length PTM with a spacer between the binding domain and splice site (PTM+Sp), PTM 
without the spacer region (PTM+) and short PTMs that contain a target binding domain 
(short PTM+) or a non-target binding region (PTM-) were added to splicing reactions 
containing pHCG target pre-mRNA. The products were RT-PCR amplified using primers 
PHCG-F and DT-3. For reactions containing the short PTMs, the reverse PCR primer was 
DT-4 ? since the binding site for DT-3 was removed from the PTM. 

[0016] Figure 3. Nucleotide sequence demonstrating the in vitro trans-spliced product 
between a PTM and target pre-mRNA. The 466 bp trans-spliced RT-PCR product from 
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Figure 2 (lane 2) was re-amplified using a 5' biotin labeled forward primer (pHCG-F) and 
a nested unlabeled reverse primer (DT-3R). Single stranded DNA was purified and 
sequenced directly using toxin specific DT-3R primer. The arrow indicates the splice 
junction between the last nucleotide of target (3HCG6 exon I and the first nucleotide 
encoding DT-A. 

[0017] Figure 4A. Schematic diagram of the "safety" PTM and variations, 
demonstrating the PTM intramolecular base-paired stem, intended to mask the BP and PPT 
from splicing factors. Underlined sequences represent the PHCG6 intron 1 complementary 
target-binding domain, sequence in italics indicate target mismatches that are homologous 
to the BP. 

[001 8] Figure 4B. Schematic of a safety PTM in open configuration upon binding to 
the target. 

[0019] Figure 4C. In vitro frans-splicing reactions were carried out by incubating 
either safety PTM or safety PTM variants with the PHCG6 target. Splicing reactions were 
amplified by RT-PCR using PHCG-F and DT-3R primers; products were analyzed in a 2.0% 
agarose gel. 

[0020] Figure 5. Specificity of targeted /raws-splicing is enhanced by the inclusion of 
a safety into the PTM. pHCG6 pre-mRNA (250 ng) and p-globin pre-mRNA (250 ng) were 
annealed together with either PTM+SF (safety) or pcPTM+Sp (linear) RNA (500 ng). In 
vitro ^raws-splicing reactions and RT-PCR analysis were performed as described under 
experimental procedures and the products were separated on a 2.0% agarose gel. Primers 
used for RT-PCR are as indicated. 
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[0021] Figure 6. In the presence of increasing PTM concentration, c/s-splicing is 
inhibited and replaced by rraras-splicing. In vitro splicing reactions were performed in the 
presence of a constant amount of PHCG6 target pre-mRNA (100 ng) with increasing 
concentrations of PTM (pcPTM+Sp) RNA (52-300 ng). RT-PCR for c*s-spliced and un- 
spliced products utilized primers pHCG-F (exon 1 specific) and pHCG-R2 (exon 2 specific 
- Panel A); primers pHCG-F and DT-3R were used to RT-PCR frara-spliced products (Panel 
B). Reaction products were analyzed on 1 .5% and 2.0% agarose gels, respectively. In panel 
A, lane 9 represents the 60 min time point in the presence of 300 ng of PTM, which is 
equivalent to lane 10 in panel B. 

[0022] Figure 7A. PTMs are capable of frara-splicing in cultured human cancer cells. 
Total RNA was isolated from each of 4 expanded neomycin resistant HI 299 lung carcinoma 
colonies transfected with pcSp+CRM (expressing non-toxic mutant DT-A) RT-PCR was 
performed using 1 jag of total RNA and 5 f biotinylated (3HCG-F and non-biotinylated DT-3R 
primers. Single stranded DNA was purified and sequenced. 

[0023] Figure 7B. Nucleotide sequence (sense strand) of the ^rans-spliced product 
between endogenous |3HCG6 target and CRM 197 mutant toxin is shown. Two arrows 
indicate the position of the splice junction. 

[0024] Figure 8A. Schematic diagram of a double splicing pre-therapeutic mRNA. 
[0025] Figure 8B. Selective trans-spXicmg of a double splicing PTM. By varying the 
PTM concentration the PTM can be /raws-spliced into either the 5 ? or the 3 ? splice site of the 
target. 

[0026] Figure 9, Schematic diagram of the use of PTM molecules for exon tagging. 
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Two examples of PTMs are shown. The PTM on the left is capable of non-specifically 
trans-splicing into a target pre-mRNA 3 ! splice site. The other PTM on the right is designed 
to non-specifically trans-splice into a target pre-mRNA 5 r splice site. A PTM mediated 
trans-splicing reaction will result in the production of a chimeric RNA comprising a specific 
tag to either the 5 r or 3 1 side of an authentic exon. 

[0027] Figure 10A. Schematic diagram of constructs for use in the lacZ knock-out 
model. The target lacZ pre-mRNA contains the 5 1 fragment of lacZ followed by pHCG6 
intron 1 and the 3' fragment of lacZ (target 1). The PTM molecule for use in the model 
system was created by digesting pPTM +SP with PstI and Hindlll and replacing the DT-A 
toxin with pHCG6 exon 2 (pc3.1PTM2). 

[0028] Figure 10B. Schematic diagram of restoration of p-Gal activity by Spliceosome 
Mediated RNA Trans-splicing. Schematic diagram of constructs for use in the lacZ 
knock-in model (pc.3.1 lacZ T2). The lacZ target pre-mRNA is identical to that target 
pre-mRNA used for the knock-out experiments except that it contains two stop codons (TAA 
TAA) in frame four codons after the 3 ? splice site. The PTM molecule for use in the model 
system was created by digesting pPTM +SP with PstI and Hindlll and replacing the DT-A 
toxin with functional 3' fragment of lacZ. 

[0029] Figure 1 1A. Demonstration of czs-and ^raws-splicing when utilizing the lacZ 
knock-out model. The LacZ splice target 1 pre-mRNA and PTM2 were co-transfected into 
293T cells. Total RNA was then isolated and analyzed by PCR for ds-spliced and trans- 
spliced products using the appropriate specific primers. The amplified PCR products were 
separated on a 2% agarose gel. 
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[0030] Figure 1 1B-C. Assays for p-galactosidase activity. 293 cells were transfected 
with lacZ target 2 DNA alone (panel B) or lacZ target 2 DNA and PTM1 (panel C). 
[0031] Figure 12A. Nucleotide sequence of trans-spliced molecule demonstrating 
accurate trans-splicing. 

[0032] Figure 12B. Nucleotide sequences of the czs-spliced product and the trans- 
spliced product. The nucleotide sequences were those sequences expected for each of the 
different splicing reactions. 

[0033] Figure 1 3 . Gene repair model for repair of the cystic fibrosis transmembrane 
regulator (CFTR) gene. 

[0034] Figure 14. RT-PCR demonstration of trans- splicing between an exogenously 
supplied CFTR mini-gene target and PTM. Plasmids were co-transfected into 293 
embryonic kidney cells. The primers pairs used for RT-PCR reactions are listed above each 
lane. The lower band (471 bp) in each lane represents a trans-spliced product. The lower 
band in lane 1 (471bp) was purified from a 2% Seakem agarose gel and the DNA sequence 
of the band was determined. 

[0035] Figure 15. DNA sequence of the /raras-spliced product (lane 1, lower band 
shown in Figure 14). The DNA sequence indicates the presence of the F508 codon (CTT% 
exon 9 sequence is contiguous with exon 10 sequence, and the His tag sequence. 
[0036] Figure 16. Schematic representation of repair of an exogenously supplied 
CFTR target molecule carrying an F508 deletion in exon 10. 

[0037] Figure 17. Repair of endogenous CFTR transcripts by exon 10 replacement 
using a double splicing PTM. The use of a double splicing PTM permits repair of the a 508 
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mutation with a very short PTM molecule. 

[0038] Figure 18. Model lacZ target consisting of lacZ 5 1 exon - CFTR mini-intron 9 
- CFTR exon 10 (delta 508) - CFTR mini-intron 10 followed by the lacZ 3 ! exon. Binding 
domains for PTMs are bracketed. 

[0039] Figure 19. Schematic representation of double-frara-splicing PTMs designed 
to restore p-gal function. 

[0040] Figure 20. Schematic representation of a double-fr-ans-splicing reaction 

showing the binding of DSPTM7 with DSCFT1.6 target pre-mRNA. 

[0041] Figure 21. Important structural elements of DSPTM7. The double splicing 

PTM has both 3* and 5 ? functional splice sites as well as binding domains. 

[0042] Figure 22. Schematic diagram of mutant double splicing PTMs. 

[0043] Figure 23. Accuracy of double-^raws-splicing reaction. 

[0044] Figure 24. Double-frara-splicing between the target pre-mRNA and the 

DSPTM7 produces full-length protein. Western blot analysis of total cell lysates using 

polyclonal anti-p-galactosidase antiserum. 

[0045] Figure 25. Precise internal exon substitution between the DSCFT1.6 target pre- 
mRNA and DSPTM7 RNA by double-/raws-splicing produces functionally active p-gal 
protein. Total cell extracts were prepared and assayed for P-gal activity using an ONPG 
assay. 

[0046] Figure 26. 3' and 5' splice sites are essential for the restoration of p-gal function 
by double-^rans-splicing reaction. 

[0047] Figure 27. Double-^raws-splicing: titration of target and PTM. Different 
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concentrations of the target and PTM were co-transfected and analyzed for p-gal activity 
restoration. 

[0048] Figure 28. Constructs designed to test the specificity of double-frara-splicing 
reaction. 

[0049] Figure 29. Specificity of a double-^raws-splicing reaction. 

[0050] Figure 30. Trans-splicing repair of the cystic fibrosis gene using a PTM that 

mediates a double-fraws-splicing event. 

[0051] Figure 3 1 . PTM with a long binding domain masking two splice sites and part 
of exon 10 in a mini-gene target. 

[0052] Figure 32. Sequence of a single PCR product showing target exon 9 correctly 
spliced to PTM exon 10 (with modified codons) (upper panel), codon 508 in exon 10 of the 
PTM (middle panel) and PTM exon 10 correctly spliced to target exon 1 1 (lower panel). The 
sequence of a repaired target was generated by RT-PCR followed by PCR. 
[0053] Figure 33. Trans-splicing repair of the cystic fibrosis gene using a PTM that 
can perform 5 ? exon replacement. 

[0054] Figure 34. Schematic diagram of three different PTM molecules with different 
binding domains. 

[0055] Figure 35. Schematic diagram of PTM exon 10 with modified codon usage to 

reduce antisense effects with its own binding domain. 

[0056] Figure 36. Sequence of cis- and trans-spliced products. 

[0057] Figure 37. Model system for repair of messenger RNAs by ^rans-splicing. 

(A) Schematic illustration of a defective lacZCF9m splice target used in the present study 
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(see Materials and Methods for details). BP, branch point; PPT, polypyrimidine tracts; ss, 
splice sites and pA, polyadenylation signal. (B) A prototype PTM showing the key 
components of the ^raws-splicing domain, and the diagrams of various PTMs showing the 
binding domain length and approximate positions at which they bind to the target pre- 
mRNA. Unique restriction sites within the £raws-splicing domain are N, Nhe I; S, Sac II; K, 
Kpn I and E, EcoR V. (C) Schematic diagram showing the binding of a PTM through 
antisense binding and repair of defective lacZ pre-mRNA through targeted RNA trans- 
splicing. Expected cis and ^raws-spliced products and the primer binding sites for Lac-9F, 
Lac-3R and Lac-5R are indicated. 

[0058] Figure 38. Efficient repair of lacZ messenger RNA. Target specific primers, 
Lac-9F (5 f exon) and Lac-3R (3' exon) were used to amplify ds-spliced products (lanes 1-6), 
while; target and PTM specific primers, Lac-9F (5 ! exon) and Lac-5R (3' exon) were used 
to amplify ^raws-spliced products (lanes 7-15). 25-50 ng of total RNA was used to measure 
target c/s-splicing (lanes 1-6) and 50-200 ng of total RNA was used to measure PTM induced 
RNA /raws-splicing (lanes 7-12). Lanes 13-15, 25-50 ng of total RNA from cells transfected 
with lacZCF9 a control for /raws-splicing. (B) Endogenous mRNA repair by ^raws-splicing. 
Lanes 1-3, RNA from cells transfected with PTM-CF14; lanes 4-6, PTM-CF22 and lanes 
7-9, PTM-CF24. Lane 10, RNA from mock-transfected cells and lane 11 is a control in 
which reverse-transcription reaction was omitted. 

[0059] Figure 39. Messenger RNA repair leads to synthesis of full-length 
p-galactosidase. Lane 1, lacZCF9 (positive control, 5 /ug); lane 2, lacZCF9m target alone 
(25 /ig); lane 3, PTM-CF24 alone (25 ng) and lane 4, lacZCF9m target + PTM-CF24 
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(25 M g)- 

[0060] Figure 40. Messenger RNA repair by SMaRT produces functional 
(3-galactosidase. (A) In situ detection of functional p-galactosidase produced by trans- 
splicing. 293T cells were either transfected (transient assay) with lacZCF9m target alone 
(panel A) or co-transfected with lacZCF9m target + PTM-CF24 (panel B) expression 
plasmids as described above. 48-hr post-transfection, cells were rinsed with PBS and stained 
in situ for p-gal activity. (B) Repair of a defective lacZ mRNA produces functional p- 
galactosidase. Target and PTM, extracts from cells transfected with either lacZCF9m target 
or PTM-CF24 plasmid alone, and the rest were from cells co-transfected with lacZCF9m 
target and one of the PTMs as indicated. (C) Endogenous mRNA repair by trans-splicing 
produces functional p-galactosidase. Stable cells expressing an endogenous lacZCF9m pre- 
mRNA target was transfected with "linear" PTMs (PTM-CF14, PTM-CF22 or PTM-CF24) 
as described above. Following transfection, total cell lysate was prepared and assayed for 
P-gal activity. The results presented are the average of two independent transfections. 
[0061] Figure 41. Messenger RNA repair is specific. (A) Experimental strategy to 
measure non-specific trans-splicing between lacZHCGlm pre-mRNA and "linear" PTMs. 
(B) Extended binding domains enhance the specificity of trans-splicing. Lanes 1-3, PTM- 
CF14; 4-6, PTM-CF22; 7-9, PTM-CF24; 10-12, PTM-CF26 and 13-15, PTM-CF27. (C) 
PTMs with very long binding domains are capable of increasing specificity. Total cell 
extract (5 //l) was assayed in solution for P-gal activity and the specific activity was 
calculated, p-gal activity was normalized to mock and the results presented are the average 
of two independent transfections. Control, extract from cells transfected with lacZHCGlm 
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target alone and the rest were co-transfected with lacZHCGlm target and one of the linear 
PTMs. 

[0062] Figure 42. Complete sequence of CFTR PTM 30 (5' exon replacement PTM) 
showing the trans-splicing domain (underlined) and the coding sequence for exons 1-10 of 
the CFTR gene. Modified codons in exon 10 are underlined and bold. 
[0063] Figure 43 A. 1 53 base-pair PTM 24 Binding Domain. 
[0064] Figure 43B. Complete sequence of CFTR PTM 24 (3 r exon replacement PTM) 
showing the ^rcms-splicing domain (underlined) and the coding sequence for exons 10-24 of 
the CFTR cDNA. At the end of the coding is a histidine tag and the translation stop codon. 
[0065] Figure 44A. Detailed structure of the mouse factor VIII PTM containing 
normal mouse sequences for exons 16-26. BGH=bovine growth hormone 3 ! UTR 
(untranslated sequence); Binding Domainal 25bp; base changes to eliminate cryptic sites are 
circled:F5, F6, F7 ? F8=primer sites. 

[0066] Figure 44B. Schematic diagram showing the extent of the binding domain in 
the mouse factor VIII gene. 

[0067] Figure 44C. Changes to the promoter in AAV vectors pDLZ20 and pDLZ20- 
M2 to eliminate cryptic donor sites in sequence upstream of the PTM binding domain. 
[0068] Figure 44D. Factor VIII repair model. Schematic diagram of a PTM binding 
to the 3' splice site of intron 15 of the mouse factor VIII gene. 

[0069] Figure 45. Schematic diagram of a F8 PTM with the ^raws-splicing domain 
eliminated. This represents a control PTM to test whether repair is a result of trans-splicing, 
[0070] Figure 46. Data indicating repair of factor VIII in Factor VIII knock out mice. 
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Blood was assayed for factor VIII activity using a coatest assay. 

[0071] Figure 47 A. Detailed structure of a mouse factor VIII PTM containing normal 
sequences for exons 16-26 and a C-terminal FLAG tag. BGENbovine growth hormone 
3"UTR; Binding domain=125 bp. 

[0072] Figure 47B, Detailed structure of a human or canine factor VIII PTM 
containing normal sequences for exons 23-26. 

5. DETAILED DESCRIPTION OF THE INVENTION 
[0073] The present invention relates to compositions comprising pre-^rans-splicing 
molecules (PTMs) and the use of such molecules for generating novel nucleic acid 
molecules. The PTMs of the invention comprise one or more target binding domains that 
are designed to specifically bind to pre-mRNA, a 3' splice region that includes a branch 
point, pyrimidine tract and a 3 f splice acceptor site and/or a 5' splice donor site; and one or 
more spacer regions that separate the RNA splice site from the target binding domain. In 
addition, the PTMs of the invention can be engineered to contain any nucleotide sequences 
such as those encoding a translatable protein product. 

[0074] The methods of the invention encompass contacting the PTMs of the invention 
with a natural pre-mRNA under conditions in which a portion of the PTM is frww-spliced 
to a portion of the natural pre-mRNA to form a novel chimeric RNA. The target pre-mRNA 
is chosen as a target due to its expression within a specific cell type thus providing a 
mechanism for targeting expression of a novel RNA to a selected cell type. The resulting 
chimeric RNA may provide a desired function, or may produce a gene product in the specific 
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cell type. The specific cells may include, but are not limited to those infected with viral or 
other infectious agents, benign or malignant neoplasms, or components of the immune 
system which are involved in autoimmune disease or tissue rejection. Specificity is achieved 
by modification of the binding domain of the PTM to bind to the target endogenous pre- 
mRNA. The gene products encoded by the chimeric RNA can be any gene, including genes 
having clinical usefulness, for example, therapeutic or marker genes, and genes encoding 
toxins. 

5.1. STRUCTURE OF THE PRE-TO^AZS-SPLICING MOLECULES 
[0075] The present invention provides compositions for use in generating novel 
chimeric nucleic acid molecules through targeted trans-splicing. The PTMs of the invention 
comprise (i) one or more target binding domains that targets binding of the PTM to a pre- 
mRNA (ii) a 3 ! splice region that includes a branch point, pyrimidine tract and a 3 1 splice 
acceptor site and/or 5' splice donor site; and (iii) one or more spacer regions to separate the 
RNA splice site from the target binding domain. Additionally, the PTMs can be engineered 
to contain any nucleotide sequence encoding a translatable protein product. In yet another 
embodiment of the invention, the PTMs can be engineered to contain nucleotide sequences 
that inhibit the translation of the chimeric RNA molecule. For example, the nucleotide 
sequences may contain translational stop codons or nucleotide sequences that form secondary 
structures and thereby inhibit translation. Alternatively, the chimeric RNA may function as 
an antisense molecule thereby inhibiting translation of the RNA to which it binds. 
[0076] The target binding domain of the PTM may contain multiple binding domains 
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which are complementary to and in anti-sense orientation to the targeted region of the 
selected pre-mRNA. As used herein, a target binding domain is defined as any sequence that 
confers specificity of binding and anchors the pre-mRNA closely in space so that the 
spliceosome processing machinery of the nucleus can trans-splice a portion of the PTM to 
a portion of the pre-mRNA. The target binding domains may comprise up to several 
thousand nucleotides. In preferred embodiments of the invention the binding domains may 
comprise at least 10 to 30 and up to several hundred nucleotides. As demonstrated herein, 
the specificity of the PTM can be increased significantly by increasing the length of the 
target binding domain. For example, the target binding domain may comprise several 
hundred nucleotides or more. In addition, although the target binding domain may be 
"linear" it is understood that the RNA may fold to form secondary structures that may 
stabilize the complex thereby increasing the efficiency of splicing. A second target binding 
region may be placed at the 3' end of the molecule and can be incorporated into the PTM of 
the invention. Absolute complementarity, although preferred, is not required. A sequence 
"complementary" to a portion of an RNA, as referred to herein, means a sequence having 
sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex. 
The ability to hybridize will depend on both the degree of complementarity and the length 
of the nucleic acid (See, for example, Sambrook et al, 1989, Molecular Cloning, A 
Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York). Generally, the longer the hybridizing nucleic acid, the more base mismatches with 
an RNA it may contain and still form a stable duplex. One skilled in the art can ascertain a 
tolerable degree of mismatch or length of duplex by use of standard procedures to determine 
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the stability of the hybridized complex. 

[0077] Where the PTMs are designed for use in intron-exon tagging or for peptide 
affinity tagging, a library of PTMs is genetically engineered to contain random nucleotide 
sequences in the target binding domain. Alternatively, for intron-exon tagging the PTMs 
may be genetically engineered so as to lack target binding domains. The goal of generating 
such a library of PTM molecules is that the library will contain a population of PTM 
molecules capable of binding to each RNA molecule expressed in the cell. A recombinant 
expression vector can be genetically engineered to contain a coding region for a PTM 
including a restriction endonuclease site that can be used for insertion of random DNA 
fragments into the PTM to form random target binding domains. The random nucleotide 
sequences to be included in the PTM as target binding domains can be generated using a 
variety of different methods well known to those of skill in the art, including but not limited 
to, partial digestion of DNA with restriction enzymes or mechanical shearing of DNA to 
generate random fragments of DNA. Random binding domain regions may also be 
generated by degenerate oligonucleotide synthesis. The degenerate oligonucleotides can be 
engineered to have restriction endonuclease recognition sites on each end to facilitate cloning 
into a PTM molecule for production of a library of PTM molecules having degenerate 
binding domains. 

[0078] Binding may also be achieved through other mechanisms, for example, through 
triple helix formation or protein/nucleic acid interactions such as those in which the PTM is 
engineered to recognize a specific RNA binding protein, i.e., a protein bound to a specific 
target pre-mRNA. Alternatively, the PTMs of the invention may be designed to recognize 
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secondary structures, such as for example, hairpin structures resulting from intramolecular 
base pairing between nucleotides within an RNA molecule. 

[0079] The PTM molecule also contains a 3 f splice region that includes a branch point, 
pyrimidine tract and a 3' splice acceptor AG site and/or a 5 1 splice donor site. Consensus 
sequences for the 5 f splice donor site and the 3 ! splice region used in RNA splicing are well 
known in the art (See, Moore, et al, 1993, The RNA World, Cold Spring Harbor Laboratory 
Press, p. 303-358). In addition, modified consensus sequences that maintain the ability to 
function as 5 ! donor splice sites and 3 ! splice regions may be used in the practice of the 
invention. Briefly, the 5 ! splice site consensus sequence is AG/GURAGU (where 
A^adenosine, U=uracil, G=guanine, C=cytosine, R=purine and /=the splice site). The 3' 
splice site consists of three separate sequence elements: the branch point or branch site, a 
polypyrimidine tract and the 3 ? consensus sequence (YAG). The branch point consensus 
sequence in mammals is YNYURAC (Y pyrimidine). The underlined A is the site of branch 
formation. A polypyrimidine tract is located between the branch point and the splice site 
acceptor and is important for different branch point utilization and 3 f splice site recognition. 
[0080] Further, PTMs comprising a 3 f acceptor site (AG) may be genetically 
engineered. Such PTMs may further comprise a pyrimidine tract and/or branch point 
sequence. 

[0081] Recently, pre-messenger RNA introns beginning with the dinucleotide AU and 
ending with the dinucleotide AC have been identified and referred to as U12 introns. U12 
intron sequences as well as any sequences that function as splice acceptor/donor sequences 
may also be used in PTMs. 
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[0082] A spacer region to separate the RNA splice site from the target binding domain 
is also included in the PTM. The spacer region can have features such as stop codons which 
would block any translation of an unspliced PTM and/or sequences that enhance trans- 
splicing to the target pre-mRNA. 

[0083] In a preferred embodiment of the invention, a "safety" is also incorporated into 
the spacer, binding domain, or elsewhere in the PTM to prevent non-specific fraws-splicing. 
This is a region of the PTM that covers elements of the 3' and/or 5 ! splice site of the PTM 
by relatively weak complementarity, preventing non-specific frYws-splicing. The PTM is 
designed in such a way that upon hybridization of the binding /targeting portion(s) of the 
PTM, the 3' and/or 5 ! splice site is uncovered and becomes fully active. 
[0084] The "safety" consists of one or more complementary stretches of czs-sequence 
(or could be a second, separate, strand of nucleic acid) which weakly binds to one or both 
sides of the PTM branch point, pyrimidine tract, 3' splice site and/or 5 ! splice site (splicing 
elements), or could bind to parts of the splicing elements themselves. This "safety" binding 
prevents the splicing elements from being active (i.e. block U2 snRNP or other splicing 
factors from attaching to the PTM splice site recognition elements). The binding of the 
"safety" may be disrupted by the binding of the target binding region of the PTM to the 
target pre-mRNA, thus exposing and activating the PTM splicing elements (making them 
available to frrms-splice into the target pre-mRNA). 

[0085] A nucleotide sequence encoding a translatable protein capable of producing an 
effect, such as cell death, or alternatively, one that restores a missing function or acts as a 
marker, is included in the PTM of the invention. For example, the nucleotide sequence can 
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include those sequences encoding gene products missing or altered in known genetic 
diseases. Alternatively, the nucleotide sequences can encode marker proteins or peptides 
which may be used to identify or image cells. In yet another embodiment of the invention 
nucleotide sequences encoding affinity tags such as, HIS tags (6 consecutive histidine 
residues) (Janknecht, et al., 1991, Proc. Natl. Acad. Sci. USA 88:8972-8976), the C-terminus 
of glutathione-S-transferase (GST) (Smith and Johnson, 1986, Proc. Natl. Acad. Sci. USA 
83:8703-8707) (Pharmacia) or FLAG (Asp-Tyr-Lys-Asp-Asp-Asp-Lys) (Eastman 
Kodak/IBI, Rochester, NY) can be included in PTM molecules for use in affinity 
purification. The use of PTMs containing such nucleotide sequences results in the 
production of a chimeric RNA encoding a fusion protein containing peptide sequences 
normally expressed in a cell linked to the peptide affinity tag. The affinity tag provides a 
method for the rapid purification and identification of peptide sequences expressed in the 
cell._In a preferred embodiment the nucleotide sequences may encode toxins or other 
proteins which provide some function which enhances the susceptibility of the cells to 
subsequent treatments, such as radiation or chemotherapy. 

[0086] In a highly preferred embodiment of the invention a PTM molecule is designed 
to contain nucleotide sequences encoding the Diphtheria toxin subunit A (Greenfield, L., et 
al., 1983, Proc. Natl Acad. Sci. USA 80: 6853-6857). Diphtheria toxin subunit A contains 
enzymatic toxin activity and will function if expressed or delivered into human cells 
resulting in cell death. Furthermore, various other known peptide toxins may be used in the 
present invention, including but not limited to, ricin, Pseudomonus toxin, Shiga toxin and 
exotoxin A. 
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[0087] Additional features can be added to the PTM molecule either after, or before, 
the nucleotide sequence encoding a translatable protein, such as polyadenylation signals or 
5' splice sequences to enhance splicing, additional binding regions, "safety"-self 
complementary regions, additional splice sites, or protective groups to modulate the stability 
of the molecule and prevent degradation. 

[0088] Additional features that may be incorporated into the PTMs of the invention 
include stop codons or other elements in the region between the binding domain and the 
splice site to prevent unspliced pre-mRNA expression. In another embodiment of the 
invention, PTMs can be generated with a second anti-sense binding domain downstream 
from the nucleotide sequences encoding a translatable protein to promote binding to the 3' 
target intron or exon and to block the fixed authentic cis-5 1 splice site (U5 and/or Ul binding 
sites). 

[0089] PTMs may also be generated that require a double-^raws-splicing reaction for 
generation of a chimeric frara-spliced product. Such PTMs could be used to replace an 
internal exon which could be used for RNA repair. PTMs designed to promote two trans- 
splicing reactions are engineered as described above, however, they contain both 5 f donor 
sites and 3 j splice acceptor sites. In addition, the PTMs may comprise two or more binding 
domains and splicer regions. The splicer regions may be place between the multiple binding 
domains and splice sites or alternatively between the multiple binding domains. 
[0090] Further elements such as a 3' hairpin structure, circularized RNA, nucleotide 
base modification, or a synthetic analog can be incorporated into PTMs to promote or 
facilitate nuclear localization and spliceosomal incorporation, and intra-cellular stability. 
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[0091] Additionally, when engineering PTMs for use in plant cells it may not be 
necessary to include conserved branch point sequences or polypyrimidine tracts as these 
sequences may not be essential for intron processing in plants. However, a 3 ! splice acceptor 
site and/or 5 f splice donor site, such as those required for splicing in vertebrates and yeast, 
will be included. Further, the efficiency of splicing in plants may be increased by also 
including UA-rich intronic sequences. The skilled artisan will recognize that any sequences 
that are capable of mediating a trans- splicing reaction in plants may be used. 
[0092] The PTMs of the invention can be used in methods designed to produce a novel 
chimeric RNA in a target cell. The methods of the present invention comprise delivering to 
the target cell a PTM which may be in any form used by one skilled in the art, for example, 
an RNA molecule, or a DNA vector which is transcribed into a RNA molecule, wherein said 
PTM binds to a pre-mRNA and mediates a rraras-splicing reaction resulting in formation of 
a chimeric RNA comprising a portion of the PTM molecule spliced to a portion of the pre- 
mRNA. 



5.2. SYNTHESIS OF THE TRANSDUCING MOLECULES 
[0093] The nucleic acid molecules of the invention can be RNA or DNA or derivatives 
or modified versions thereof, single-stranded or double-stranded. By nucleic acid is meant 
a PTM molecule or a nucleic acid molecule encoding a PTM molecule, whether composed 
of deoxyribonucleotides or ribonucleosides, and whether composed of phosphodiester 
linkages or modified linkages. The term nucleic acid also specifically includes nucleic acids 
composed of bases other than the five biologically occurring bases (adenine, guanine, 
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thymine, cytosine and uracil). 

[0094] The RNA and DNA molecules of the invention can be prepared by any method 
known in the art for the synthesis of DNA and RNA molecules. For example, the nucleic 
acids may be chemically synthesized using commercially available reagents and synthesizers 
by methods that are well known in the art (see, e.e., Gait, 1985, Oligonucleotide Synthesis: 
A Practical Approach, IRL Press, Oxford, England). Alternatively, RNA molecules can be 
generated by in vitro and in vivo transcription of DNA sequences encoding the RNA 
molecule. Such DNA sequences can be incorporated into a wide variety of vectors which 
incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase 
promoters. RNAs may be produced in high yield via in vitro transcription using plasmids 
such as SPS65 (Promega Corporation, Madison, WI). In addition, RNA amplification 
methods such as Q-fJ amplification can be utilized to produce RNAs. 
[0095] The nucleic acid molecules can be modified at the base moiety, sugar moiety, 
or phosphate backbone, for example, to improve stability of the molecule, hybridization, 
transport into the cell, etc. For example, modification of a PTM to reduce the overall charge 
can enhance the cellular uptake of the molecule. In addition modifications can be made to 
reduce susceptibility to nuclease degradation. The nucleic acid molecules may include other 
appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents 
facilitating transport across the cell membrane (see, e.g., Letsinger et aL, 1989, Proc. Natl. 
Acad. Sci. U.S.A. 86:6553-6556; Lemaitre etal, 1987, Proc. Natl. Acad. Sci. 84:648-652; 
PCT Publication No. W088/09810, published December 15, 1988) or the blood-brain barrier 
(see, e.g., PCT Publication No. W089/10134, published April 25, 1988), hybridization- 
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triggered cleavage agents. (See, e.g., Krol et al, 1988, BioTechniques 6:958-976) or 
intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the nucleic 
acid molecules may be conjugated to another molecule, e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 
Various other well-known modifications to the nucleic acid molecules can be introduced as 
a means of increasing intracellular stability and half-life. Possible modifications include, but 
are not limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5' 
and/or 3 r ends of the molecule. In some circumstances where increased stability is desired, 
nucleic acids having modified internucleoside linkages such as 2 r -0-methylation may be 
preferred. Nucleic acids containing modified internucleoside linkages may be synthesized 
using reagents and methods that are well known in the art (see, Uhlmann et al, 1990, Chem. 
Rev. 90:543-584; Schneider et ah, 1990, Tetrahedron Lett. 31:335 and references sited 
therein). 

[0096] The nucleic acids may be purified by any suitable means, as are well known in 
the art. For example, the nucleic acids can be purified by reverse phase chromatography or 
gel electrophoresis. Of course, the skilled artisan will recognize that the method of 
purification will depend in part on the size of the nucleic acid to be purified. 
[0097] In instances where a nucleic acid molecule encoding a PTM is utilized, cloning 
techniques known in the art may be used for cloning of the nucleic acid molecule into an 
expression vector. Methods commonly known in the art of recombinant DNA technology 
which can be used are described in Ausubel et al (eds.), 1993, Current Protocols in 
Molecular Biology, John Wiley & Sons, NY; and Kriegler, 1990, Gene Transfer and 
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Expression, A Laboratory Manual, Stockton Press, NY. 

[0098] The DNA encoding the PTM of interest may be recombinantly engineered into 
a variety of host vector systems that also provide for replication of the DNA in large scale 
and contain the necessary elements for directing the transcription of the PTM. The use of 
such a construct to transfect target cells in the patient will result in the transcription of 
sufficient amounts of PTMs that will form complementary base pairs with the endogenously 
expressed pre-mRNA targets and thereby facilitate a trans-splicing reaction between the 
complexed nucleic acid molecules. For example, a vector can be introduced in vivo such 
that it is taken up by a cell and directs the transcription of the PTM molecule. Such a vector 
can remain episomal or become chromosomally integrated, as long as it can be transcribed 
to produce the desired RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art. 

[0099] Vectors encoding the PTM of interest can be plasmid, viral, or others known 
in the art, used for replication and expression in mammalian cells. Expression of the 
sequence encoding the PTM can be regulated by any promoter known in the art to act in 
mammalian, preferably human cells. Such promoters can be inducible or constitutive. Such 
promoters include but are not limited to: the SV40 early promoter region (Benoist, C. and 
Chambon, P. 1981, Nature 290:304-310), the promoter contained in the 3' long terminal 
repeat of Rous sarcoma virus (Yamamoto et al, 1980, Cell 22:787-797), the herpes 
thymidine kinase promoter (Wagner et aL, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441- 
1445), the regulatory sequences of the metallothionein gene (Brinster et al, 1982, Nature 
296:39-42), the viral CMV promoter, the human chorionic gonadotropin-p promoter 
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(Hollenberg et al., 1994, Mol. Cell. Endocrinology 106: 1 1 1-119), etc. Any type of plasmid, 
cosmid, YAC or viral vector can be used to prepare the recombinant DNA construct which 
can be introduced directly into the tissue site. Alternatively, viral vectors can be used which 
selectively infect the desired target cell _ 

[00100] For use of PTMs encoding peptide affinity purification tags, it is desirable to 
insert nucleotide sequences containing random target binding sites into the PTMs and clone 
them into a selectable mammalian expression vector system. A number of selection systems 
can be used, including but not limited to selection for expression of the herpes simplex virus 
thymidine kinase, hypoxanthine-guanine phosphoribosyltransterase and adenine 
phosphoribosyl tranferase protein in tk-, hgprt- or aprt- deficient cells, respectively. Also, 
anti-metabolic resistance can be used as the basis of selection for dihydrofolate tranferase 
{dhfr\ which confers resistance to methotrexate; xanthine-guanine phosphoribosyl 
transferase (gpt), which confers resistance to mycophenolic acid; neomycin (neo), which 
confers resistance to aminoglycoside G-418; and hygromycin B phosphotransferase (hygro) 
which confers resistance to hygromycin. In a preferred embodiment of the invention, the cell 
culture is transformed at a low ratio of vector to cell such that there will be only a single 
vector, or a limited number of vectors, present in any one cell. Vectors for use in the practice 
of the invention include any eukaryotic expression vectors, including but not limited to viral 
expression vectors such as those derived from the class of retroviruses or adeno-associated 
viruses. 
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5.3. USES AND ADMINISTRATION OF TRANS-SPUCNG MOLECULES 

5.3.1. USE OF PTM MOLECULES FOR GENE REGULATION, GENE 
REPAIR AND TARGETED CELL DEATH 

[00101] The compositions and methods of the present invention will have a variety of 

different applications including gene regulation, gene repair and targeted cell death. For 

example, trans-splicing can be used to introduce a protein with toxic properties into a cell. 

In addition, PTMs can be engineered to bind to viral mRNA and destroy the function of the 

viral mRNA, or alternatively, to destroy any cell expressing the viral mRNA. In yet another 

embodiment of the invention, PTMs can be engineered to place a stop codon in a deleterious 

mRNA transcript thereby decreasing the expression of that transcript. 

[00102] Targeted ^raws-splicing, including double-^raws-splicing reactions, can be used 

to repair or correct transcripts that are either truncated or contain point mutations. The 

PTMs of the invention are designed to cleave a targeted transcript upstream or downstream 

of a specific mutation or upstream of a premature 3 1 and correct the mutant transcript via a 

trans-splicing reaction which replaces the portion of the transcript containing the mutation 

with a functional sequence. 

[00103] In addition, double trans-splicing reactions may be used for the selective 
expression of a toxin in tumor cells. For example, PTMs can be designed to replace the 
second exon of the human p-chronic gonadotropin-6 (phCG6) gene transcripts and to deliver 
an exon encoding the subunit A of diptheria toxin (DT-A). Expression of DT-A in the 
absence of subunit B should lead to toxicity only in the cells expressing the gene. phCG6 is 
a prototypical target for genetic modification by trans-splicing. The sequence and the 
structure of the phCG6 gene are completely known and the pattern of splicing has been 
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determined. The phCG6 gene is highly expressed in many types of solid tumors, including 
many non-germ line tumors, but the phCG6 gene is silent in the majority cells in a normal 
adult. Therefore, the phCG6 pre-mRNA represents a desirable target for a trans-splicing 
reaction designed to produce tumor-specific toxicity. 

[00104] The first exon of phCG6 pre-mRNA is ideal in that it encodes only five amino 
acids, including the initiator AUG, which should result in minimal interference with the 
proper folding of the DT-A toxin while providing the required signals for effective 
translation of the trans-spliced mRNA. The DT-A exon, which is designed to include a stop 
codon to prevent chimeric protein formation, will be engineered to trans-splice into the last 
exon of the phCG6 gene. The last exon of the phCG6 gene provides the construct with the 
appropriate signals to polyadenylate the mRNA and ensure translation. 
[00105] Cystic fibrosis (CF) is one of the most common fatal genetic disease in humans. 
Based on both genetic and molecular analyses, the gene associated with cystic fibrosis has 
been isolated and its protein product deduced (Kerem, B.S. et al, 1989, Science 245:1073- 
1080; Riordan et al., 1989, Science 245: 1066-1 073 ;Rommans, et al., 1989, Science 
245:1059-1065). The protein product of the CF associated gene is called the cystic fibrosis 
transmembrane conductance regulator (CFTR). In a specific embodiment of the invention, 
a trans-splicing reaction will be used to correct a genetic defect in the DNA sequence 
encoding the cystic fibrosis transmembrane regulator (CFTR) whereby the DNA sequence 
encoding the cystic fibrosis trans-membrane regulator protein is expressed and a functional 
chloride ion channel is produced in the airway epithelial cells of a patient. 
[00106] Population studies have indicated that the most common cystic fibrosis 
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mutation is a deletion of the three nucleotides in exon 10 that encode phenylalanine at 
position 508 of the CFTR amino acid sequence. As indicated in Figure 15, a trans-splicing 
reaction was capable of correcting the deletion at position 508 in the CFTR amino acid 
sequence. The PTM used for correction of the genetic defect contained a CFTR BD intron 
9 sequence, a spacer sequence, a branch point, a polypyrimidine tract, a 3' splice site and a 
wild type CFTR BD exon 10 sequence (Figure 13). The successful correction of the mutated 
DNA encoding CFTR utilizing a /raws-splicing reaction supports the general application of 
PTMs for correction of genetic defects. 

[00107] HemophiliaA is an X-linked bleeding disorder characterized by a deficiency in 
the activity of factor VIII, a n important component of the coagulation cascade. The 
incidence of hemophilia A is approximately 1 in 5,000 to 10,000 males. Affected individuals 
suffer joint and muscle hemorrhage, easy bruising, and prolonged bleeding from wounds. 
Hemophilia A arises from a variety of mutations within the factor VIII gene. The gene 
comprises 26 exons and spans 186 kb. About 95 percent of those patients with hemophilia 
A in whom mutations have been characterized, have point mutations in the gene. In a 
specific embodiment of the invention, a trans-splicing reaction will be used to correct a 
genetic defect in the DNA sequence encoding factor VIII whereby the DNA sequence 
encoding the factor VIII protein is expressed and a functional clotting factor is produced in 
the plasma of a patient. The PTMs of the invention can be genetically engineered to repair 
any exon of interest, or combination of exons for the purpose of correcting a defect in the 
coding region of the factor VIII gene. 

[00108] Genetic studies have indicated that the most common factor VIII mutation(s) 
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are be generated. As indicated in Figure 46, a ^raw-splicing reaction was capable of 
correcting the mutation in the factor VIII amino acid sequence. The mutation was created by 
an insertion of the neomycin gene into exon 16 and intron 16 of the mouse gene, interupting 
the open reading frame of exon 16 and eliminating intron 16's 3 f splice donor site. The PTM 
used for correction of the genetic defect contained factor VM exons 16-24 coding sequences, 
a spacer sequence, a branch point, a polypyrimidine tract, and a 3' acceptor splice (Figure 
44A). The successful correction of the mutated DNA encoding factor VIII utilizing a trans- 
splicing reaction further supports the general application of PTMs for correction of genetic 
defects. 

[00109] The methods and compositions of the invention may also be used to regulate 
gene expression in plants. For example, *raw-splicing may be used to place the expression 
of any engineered gene under the natural regulation of a chosen target plant gene, thereby 
regulating the expression of the engineered gene. Traw-splicing may also be used to prevent 
the expression of engineered genes in non-host plants or to convert an endogenous gene 
product into a more desirable product, 

[00110] In a specific embodiment of the invention tran-splicmg may be used to regulate 
the expression of the insecticidal gene that produces Bt toxin (Bacillus thuringiensis). For 
example, the PTM may be designed to *raw-splice into an injury response gene (pre-mRNA) 
that is expressed only after an insect bites the plant. Thus, all cells of the plant would carry 
the gene for Bt in the PTM, but the cells would only produce Bt when and where an insect 
injures the plant. The rest of the plant will make little or no Bt. A PTM could fraw-splice 
the Bt gene into any chosen gene with a desired pattern of expression. Further, it should be 
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possible to target a PTM so that no Bt is produced in the edible portion of the plant. 
[00111] One advantage associated with the use of PTMs is that the PTM acquires the 
native gene control elements of the target gene, thus, reducing the time and effort that might 
otherwise be spent attempting to identify and reconstitute appropriate regulatory sequences 
upstream of an engineered gene. Thus, expression of the PTM regulated gene should occur 
only in those plant cells containing the target pre-mRNA. By targeting a gene not expressed 
in the edible portion of the plant or in the pollen, trans-splicing cm alleviate opposition to 
genetically modified plants, as consumers would not be eating the proteins made from 
modified genes. The edible portion of such crops should test negative for genetically 
modified proteins. 

[00112] In addition, PTM can be targeted to a unique sequence of the host gene that is 
not present in other plants. Therefore, even if the gene (DNA) which encodes the PTM 
jumps to mother species of plant, the PTM gene will not have an appropriate target for trans- 
splicing. Thus, trans-sphcing offers a "fail-safe" mode for prevention of gene "jumping" to 
other plant species: the PTM gene will be expressed only in the engineered host plant, which 
contains the appropriate target pre-mRNA. Expression in non-engineered plants would not 
be possible. 

[00113] Trara-splicing also provides a more efficient way to convert one gene product 
into another. For example, trans-splicing ribozymes and chimeric oligos can be incorporated 
into corn genomes to modify the ratio of saturated to unsaturated oils. Jra^-splicing can 
also be used to convert one gene product into another. 

[00114] Various delivery systems are known and can be used to transfer the 
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compositions of the invention into cells, e.g. encapsulation in liposomes, microparticles, 
microcapsules, recombinant cells capable of expressing the composition, receptor-mediated 
endocytosis (see, e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432), construction of 
a nucleic acid as part of a retroviral or other vector, injection of DNA, electroporation, 
calcium phosphate mediated transfection, etc. 

[00115] The compositions and methods can be used to treat cancer and other serious 
viral infections, autoimmune disorders, and other pathological conditions in which the 
alteration or elimination of a specific cell type would be beneficial. Additionally, the 
compositions and methods may also be used to provide a gene encoding a functional 
biologically active molecule to cells of an individual with an inherited genetic disorder where 
expression of the missing or mutant gene product produces a normal phenotype. 
[00116] In a preferred embodiment, nucleic acids comprising a sequence encoding a 
PTM are administered to promote PTM function, by way of gene delivery and expression 
into a host cell. In this embodiment of the invention, the nucleic acid mediates an effect by 
promoting PTM production. Any of the methods for gene delivery into a host cell available 
in the art can be used according to the present invention. For general reviews of the methods 
of gene delivery see Strauss, M. and Barranger, J. A., 1997, Concepts in Gene Therapy, by 
Walter de Gruyter & Co., Berlin; Goldspiel et al 9 1993, Clinical Pharmacy 12:488-505; Wu 
and Wu, 1991, Biotherapy 3:87-95; Tolstoshev, 1993, Ann. Rev. Pharmacol. Toxicol. 
33:573-596; Mulligan, 1993, Science 260:926-932; and Morgan and Anderson, 1993, Ann. 
Rev. Biochem. 62:191-217; 1993, TIBTECH 11(5):155-215. Exemplary methods are 
described below. 
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[00117] Delivery of the nucleic acid into a host cell may be either direct, in which case 
the host is directly exposed to the nucleic acid or nucleic acid-carrying vector, or indirect, 
in which case, host cells are first transformed with the nucleic acid in vitro, then transplanted 
into the host. These two approaches are known, respectively, as in vivo or ex vivo gene 
delivery. 

[00118] In a specific embodiment, the nucleic acid is directly administered in vivo, 
where it is expressed to produce the PTM. This can be accomplished by any of numerous 
methods known in the art, e.g., by constructing it as part of an appropriate nucleic acid 
expression vector and administering it so that it becomes intracellular, e.g. by infection using 
a defective or attenuated retroviral or other viral vector (see U.S. Patent No. 4,980,286), or 
by direct injection of naked DNA, or by use of microparticle bombardment (e.g., a gene gun; 
Biolistic, Dupont), or coating with lipids or cell-surface receptors or transfecting agents, 
encapsulation in liposomes, microparticles, or microcapsules, or by administering it in 
linkage to a peptide which is known to enter the nucleus, by administering it in linkage to 
a ligand subject to receptor-mediated endocytosis (see e.g., Wu and Wu, 1987, J. Biol. Chem. 
262:4429-4432). 

[001 19] In a specific embodiment, a viral vector that contains the PTM can be used. For 
example, a retroviral vector can be utilized that has been modified to delete retroviral 
sequences that are not necessary for packaging of the viral genome and integration into host 
cell DNA (see Miller et al. 9 1993, Meth. Enzymol. 217:581-599). Alternatively, adenoviral 
or adeno-associated viral vectors can be used for gene delivery to cells or tissues. (See, 
Kozarsky and Wilson, 1993, Current Opinion in Genetics and Development 3:499-503 for 
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a review of adenoviras-based gene delivery). 

[00120] Another approach to gene delivery into a cell involves transferring a gene to 
cells in tissue culture by such methods as electroporation, lipofection, calcium phosphate 
mediated transfection, or viral infection. Usually, the method of transfer includes the transfer 
of a selectable marker to the cells. The cells are then placed under selection to isolate those 
cells that have taken up and are expressing the transferred gene. The resulting recombinant 
cells can be delivered to a host by various methods known in the art. In a preferred 
embodiment, the cell used for gene delivery is autologous to the host cell. 
[00121] The present invention also provides for pharmaceutical compositions 
comprising an effective amount of a PTM or a nucleic acid encoding a PTM, and a 
pharmaceutical^ acceptable carrier. In a specific embodiment, the term "pharmaceutically 
acceptable" means approved by a regulatory agency of the Federal or a state government or 
listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in 
animals, and more particularly in humans. The term "carrier" refers to a diluent, adjuvant, 
excipient, or vehicle with which the therapeutic is administered. Examples of suitable 
pharmaceutical carriers are described in "Remington's Pharmaceutical sciences" by E.W. 
Martin. 

[00122] In specific embodiments, pharmaceutical compositions are administered: (1) 
in diseases or disorders involving an absence or decreased (relative to normal or desired) 
level of an endogenous protein or function, for example, in hosts where the protein is 
lacking, genetically defective, biologically inactive or underactive, or under expressed; or 
(2) in diseases or disorders wherein, in vitro or in vivo, assays indicate the utility of PTMs 
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that inhibit the function of a particular protein . The activity of the protein encoded for by 
the chimeric mRNA resulting from the PTM mediated ^raws-splicing reaction can be readily 
detected, e.g., by obtaining a host tissue sample (e.g., from biopsy tissue) and assaying it in 
vitro for mRNA or protein levels, structure and/or activity of the expressed chimeric mRNA. 
Many methods standard in the art can be thus employed, including but not limited to 
immunoassays to detect and/or visualize the protein encoded for by the chimeric mRNA 
(e.g., Western blot, immunoprecipitation followed by sodium dodecyl sulfate polyacrylamide 
gel electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
formation of chimeric mRNA expression by detecting and/or visualizing the presence of 
chimeric mRNA (e.g., Northern assays, dot blots, in situ hybridization, and Reverse- 
Transcription PCR, etc.), etc. 

[00123] The present invention also provides for pharmaceutical compositions 
comprising an effective amount of a PTM or a nucleic acid encoding a PTM, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term "pharmaceutically 
acceptable" means approved by a regulatory agency of the Federal or a state government or 
listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in 
animals, and more particularly in humans. The term "carrier" refers to a diluent, adjuvant, 
excipient, or vehicle with which the therapeutic is administered. Examples of suitable 
pharmaceutical carriers are described in "Remington's Pharmaceutical sciences" by E.W. 
Martin. In a specific embodiment, it may be desirable to administer the pharmaceutical 
compositions of the invention locally to the area in need of treatment. This may be achieved 
by, for example, and not by way of limitation, local infusion during surgery, topical 
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application, e.g., in conjunction with a wound dressing after surgery, by injection, by means 
of a catheter, by means of a suppository, or by means of an implant, said implant being of 
a porous, non-porous, or gelatinous material, including membranes, such as sialastic 
membranes, or fibers. Other control release drug delivery systems, such as nanoparticles, 
matrices such as controlled-release polymers, hydrogels. 

[00124] The PTM will be administered in amounts which are effective to produce the 
desired effect in the targeted cell. Effective dosages of the PTMs can be determined through 
procedures well known to those in the art which address such parameters as biological half- 
life, bioavailability and toxicity. The amount of the composition of the invention which will 
be effective will depend on the nature of the disease or disorder being treated, and can be 
determined by standard clinical techniques. In addition, in vitro assays may optionally be 
employed to help identify optimal dosage ranges. 

[00125] The present invention also provides a pharmaceutical pack or kit comprising 
one or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention optionally associated with such container(s) can be a notice 
in the form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

5.3.2. USE OF PTM MOLECULES FOR BXQN TAGGING 
[00126] In view of current efforts to sequence and characterize the genomes of humans 
and other organisms, there is a need for methods that facilitate such characterization. A 
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majority of the information currently obtained by genomic mapping and sequencing is 
derived from complementary DNA (cDNA) libraries, which are made by reverse 
transcription of mRNA into cDNA. Unfortunately, this process causes the loss of 
information concerning intron sequences and the location of exon/intron boundaries. 
[00127] The present invention encompasses a method for mapping exon-intron 
boundaries in pre-mRNA molecules comprising (i) contacting a pre-trans-splicing molecule 
with a pre-mRNA molecule under conditions in which a portion of the pre-trans-splicing 
molecule is trans-spliced to a portion of the target pre-mRNA to form a chimeric mRNA; (ii) 
amplifying the chimeric mRNA molecule; (iii) selectively purifying the amplified molecule; 
and (iv) determining the nucleotide sequence of the amplified molecule thereby identifying 
the intron-exon boundaries. 

[00128] In an embodiment of the present invention, PTMs can be used in trans-splicing 
reactions to locate exon-intron boundaries in pre-mRNAs molecules. PTMs for use in 
mapping of intron-exon boundaries have structures similar to those described above in 
Section 5.1. Specifically, the PTMs contain (i) a target binding domain that is designed to 
bind to many pre-mRNAs: (ii) a 3 f splice region that includes a branch point, pyrimidine tract 
and a 3' splice acceptor site, or a 5' splice donor site; (iii) a spacer region that separates the 
mRNA splice site from the target binding domain; and (iv) a tag region that will be trans- 
spliced onto a pre-mRNA. Alternatively, the PTMs to be used to locate exon-intron 
boundaries may be engineered to contain no target binding domain. 
[00129] For purposes of intron-exon mapping, the PTMs are genetically engineered to 
contain target binding domains comprising random nucleotide sequences. The random 
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nucleotide sequences contain at least 15-30 and up to several hundred nucleotide sequences 
capable of binding and anchoring a pre-mRNA so that the spliceosome processing machinery 
of the nucleus can trans-splice a portion (tag or marker region) of the PTM to a portion of 
the pre-mRNA. PTMs containing short target binding domains, or containing inosines bind 
under less stringent conditions to the pre-mRNA molecules. In addition, strong branch point 
sequences and pyrimidine tracts serve to increase the non-specificity of PTM trans-splicing. 
[00130] The random nucleotide sequences used as target binding domains in the PTM 
molecules can be generated using a variety of different methods, including, but not limited 
to, partial digestion of DNA with restriction endonucleases or mechanical shearing of the 
DNA. The use of such random nucleotide sequences is designed to generate a vast array of 
PTM molecules with different binding activities for each target pre-mRNA expressed in a 
cell. Randomized libraries of oligonucleotides can be synthesized with appropriate 
restriction endonucleases recognition sites on each end for cloning into PTM molecules 
genetically engineered into plasmid vectors. When the randomized oligonucleotides are 
litigated and expressed, a randomized binding library of PTMs is generated. 
[00131] In a specific embodiment of the invention, an expression library encoding PTM 
molecules containing target binding domains comprising random nucleotide sequences can 
be generated using a variety of methods which are well known to those of skill in the art. 
Ideally, the library is complex enough to contain PTM molecules capable of interacting with 
each target pre-mRNA expressed in a cell 

[00132] By way of example, Figure 9 is a schematic representation of two forms of 
PTMs which can be utilized to map intron-exon boundaries. The PTM on the left is capable 

NY02 302146.1 /io 



31304B-A-D 



of non-specifically trans-splicing into a pre-mRNA 3' splice site, while the PTM on the right 
is capable of trans-splicing into a pre-mRNA 5 f splice site. Trans-splicing between the PTM 
and the target pre-mRNA results in the production of a chimeric mRNA molecule having a 
specific nucleotide sequence "tag" on either the 3 ! or 5 ! end of an authentic exon. 
[001 33] Following selective purification, a DNA sequencing reaction is then performed 
using a primer which begins in the tag nucleotide sequence of the PTM and proceeds into the 
sequence of the tagged exon. The sequence immediately following the last nucleotide of the 
tag nucleotide sequence represents an exon boundary. For identification of intron-exon tags, 
the trans-splicing reactions of the invention can be performed either in vitro or in vivo using 
methods well known to those of skill in the art. 



5.3.3. USE OF PTM MOLECULES FOR IDENTIFICATION OF 
PROTEINS EXPRESSED IN A CELL 

[00134] In yet another embodiment of the invention, PTM mediated trans-splicing 

reactions can be used to identify previously undetected and unknown proteins expressed in 

a cell. This method is especially useful for identification of proteins that cannot be detected 

by a two-dimensional electrophoresis, or by other methods, due to inter alia the small size 

of the protein, low concentration of the protein, or failure to detect the protein due to similar 

migration patterns with other proteins in two-dimensional electrophoresis. 

[00135] The present invention relates to a method for identifying proteins expressed in 

a cell comprising (i) contacting a pre-trans-splicing molecule containing a random target 

binding domain and a nucleotide sequence encoding a peptide tag with a pre-mRNA 

molecule under conditions in which a portion of the pre-trans-splicing molecule is trans- 
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spliced to a portion of the target pre-mRNA to form a chimeric mRNA encoding a fusion 
polypeptide or separating it by gel electrophoresis (ii) affinity purifying the fusion 
polypeptide; and (iii) determining the amino acid sequence of the fusion protein. 
[00136] To identify proteins expressed in a cell, the PTMs of the invention are 
genetically engineered to contain: (i) a target binding domain comprising randomized 
nucleotide sequences; (ii) a 3' splice region that includes a branch point, pyrimidine tract and 
a 3 f splice acceptor site and/or a 5 ! splice donor site; (iii) a spacer region that separates the 
PTM splice site from the target binding domain; and (iv) nucleotide sequences encoding a 
marker or peptide affinity purification tag. Such peptide tags include, but are not limited to, 
HIS tags (6 histidine consecutive residues) (Janknecht, et al., 1991 Proc. Natl. Acad. Sci. 
USA 88:8972-8976), glutathione-S-transferase (GST) (Smith, D.B. and Johnson K.S., 1988, 
Gene 67:31) (Pharmacia) or FLAG (Kodak/IBI) tags (Nisson, J. et al. J. Mol. Recognita 
1996, 5:585-594) 

[00137] Trans-splicing reactions using such PTMs results in the generation of chimeric 
mRNA molecules encoding fusion proteins comprising protein sequences normally 
expressed in a cell linked to a marker or peptide affinity purification tag. The desired goal 
of such a method is that every protein synthesized in a cell receives a marker or peptide 
affinity tag thereby providing a method for identifying each protein expressed in a cell. 
[00138] In a specific embodiment of the invention, PTM expression libraries encoding 
PTMs having different target binding domains comprising random nucleotide sequences are 
generated. The desired goal is to create a PTM expression library that is complex enough 
to produce a PTM capable of binding to each pre-mRNA expressed in a cell. In a preferred 
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embodiment, the library is cloned into a mammalian expression vector that results in one, 
or at most, a few vectors being present in any one cell. 

[00139] To identify the expression of chimeric proteins, host cells are transformed with 
the PTM library and plated so that individual colonies containing one PTM vector can be 
grown and purified. Single colonies are selected, isolated, and propagated in the appropriate 
media and the labeled chimeric protein exon(s) fragments are separated away from other 
cellular proteins using, for example, an affinity purification tag. For example, affinity 
chromatography can involve the use of antibodies that specifically bind to a peptide tag such 
as the FLAG tag. Alternatively, when utilizing HIS tags, the fusion proteins are purified 
using a Ni 2+ nitriloacetic acid agarose columns, which allows selective elution of bound 
peptide eluted with imidazole containing buffers. When using GST tags, the fusion proteins 
are purified using glutathione-S-transferase agarose beads. The fusion proteins can then be 
eluted in the presence of free glutathione. 

[00140] Following purification of the chimeric protein, an analysis is carried out to 
determine the amino acid sequence of the fusion protein. The amino acid sequence of the 
fusion protein is determined using techniques well known to those of skill in the art, such as 
Edman Degradation followed by amino acid analysis using HPLC, mass spectrometry or an 
amino acid analyzation. Once identified, the peptide sequence is compared to those 
sequences available in protein databases, such as GenBank. If the partial peptide sequence 
is already known, no further analysis is done. If the partial protein sequence is unknown, 
then a more complete sequence of that protein can be carried out to determine the full protein 
sequence. Since the fusion protein will contain only a portion of the full length protein, a 
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nucleic acid encoding the full length protein can be isolated using conventional methods. 
For example, based on the partial protein sequence oligonucleotide primers can be generated 
for use as probes or PCR primers to screen a cDNA library. 

6. EXAMPLE: PRODUCTION OF TRANS-SP1 ACJNG MOLECULES 
[00141] The following section describes the production of PTMs and the demonstration 
that such molecules are capable of mediating trans-splicing reactions resulting in the 
production of chimeric mRNA molecules. 

6.1. MATERIALS AND METHODS 
6.1.1. CONSTRUCTION OF PRE-mRNA MOLECULES 

[00142] Plasmids containing the wild type diphtheria toxin subunit A (DT-A, wild-type 
accession #K01722) and a DT-A mutant (CRM 197 ? no enzymatic activity) were obtained 
from Dr. Virginia Johnson, Food and Drug Administration, Bethesda, Maryland (Uchida et 
aL, 1973 J. Biol. Chem 248:3838). For in vitro experiments, DT-A was amplified using 
primers: DT-1F (5 ! -GGCGCTGCAGGGCGCTGATGATGTTGTTG); and DT-2R (5 f - 
GGCGAAG CTTGGATCCGACACGATTTCCTGCACAGG), cut with PstI and Hindlll, 
and cloned into PstI and Hindlll digested pBS(-) vector (Stratagene, La Jolla, CA). The 
resulting clone, pDTA was used to construct the individual PTMs. (1) pPTM+: Targeted 
construct. Created by inserting IN3-1 (5 f AATTCTCTAGATGCTT 

CACCCGGGCCTGACTCGAGTACTAACTGGTACCTCTTCTTTTTTTTCCTGCA) and 
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IN2-4 (5'-GGAAAAAAAAGAAGAGGTACCAGTTAGTACTCGAGTCAGG 
CCCGGGTGAAGCATCTAGAG) primers into EcoRI and Pstl digested pDTA. (2) 
pPTM+Sp: As pPTM+ but with a 30 bp spacer sequence between the BD and BP. Created 
by digesting pPTM+ with Xhol and ligating in the oligonucleotides, spacer S (5- 
TCGAGCAACGTTATAATAATGTTC) and spacer AS (5'-TCGAGAACATTATT 
ATAACGTTGC). For in vivo studies, an EcoRI and Hindlll fragment of pcPTM+Sp was 
cloned into mammalian expression vector pcDNA3.1 (Invitrogen), under the control of a 
CMV promoter. Also, the methionine at codon 14 was changed into isoleucine to prevent 
initiation of translation. The resulting plasmid was designated as pcPTM+Sp. (3) 
pPTM+CRM: As pPTM+Sp but the wild type DT-A was substituted with CRM mutant DT- 
A (T. Uchida, et a!., 1973, J. Biol. Chem. 248:3838). This was created by PCR amplification 
of a DT-A mutant (mutation at G52E) using primers DT-1F and DT-2R. For in vivo studies, 
an EcoRI Hindlll fragment of PTM+CRM was cloned into pc3.1DNA that resulted in 
pcPTM+ARM. (4) PTM-: Non-targeted construct. Created by digestion of PTM+ with 
EcoRI and Pst I, gel purified to remove the binding domain followed by ligation of the 
oligonucleotides, IN-5 (5 '- ATCTCTAGATC AGGCCCGGGTGAAGCC CGAG) and IN-6 
(5'-TGCTTCACCC GGGCCTGATCTAGAG). (5) PTM-Sp, is an identical version of the 
PTM-, except it has a 30 bp spacer sequence at the Pstl site. Similarly, the splice mutants 
[Py(-)AG(-) and BP(-)Py(-)AG(-)] and safety variants [PTM+SF-Pyl, PTM+SF-Py2, 
PTM+SFBP3 and PTM+SFBP3-Pyl] were constructed either by insertion or deletion of 
specific sequences (see 1). 
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Table 1. Binding/non-binding domain, BP, PPT and 3' as sequences of different PTMs. 


PTM construct 


BD/NBD 


BP 


PPT 


3'ss 


PTM+Sp (targeted) 


: TGCTTC ACCCGGGCCTG A 


TACTAAC 


CTCTTCTTTTTTTTCC 


CAG 


PTM-Sp (non-targeted) 


: C AACGTTATAATA ATGTT 


TACTAAC 


CTCTTCTTTTTTTTCC 


CAG 


PTM+Py(-)AG(-)BP(-) 


:TGCTTCACCCGGGCCTGA 


GGCTGAT 


CTGTGATTAATAGCGG 


ACG 


PTM+Py(-)AG(-) 


: TGCTTC ACCCGGGCCTG A 


TACTAAC 


CCTGGACGCGGAAGTT 


ACG 


PTM+SF 


:CTGGGACAAGGACACTGCTT 
CACCCGGTTAGTAGACCACA 
GCCCTGAAGCC 


TACTAAC 




CAG 


PTM+SF-Py 


:As inPTM+SF 


TACTAAC 


CTTCTGTATTATTCTC 


CAG 


PTM+SF-Py 


:As in PTM+SF 


TACTAAC 


GTTCTGTCCTTGTCTC 


CAG 


PTM+SF-BP3 


:As in PTM+SF 


TGCTGAC 




CAG 


PTM+SFBP3-Py 


:As in PTM+SF 


TGCTGAC 


CTTCTGTATTATTCTC 


CAG 



Nucleotides in bold indicate the mutations compared to normal BP, PPT and 3 ! splice site. 

Branch site A is underlined. The nucleotides in italics indicates the mismatch introduced 
into safety BD to mask the BP sequence in the PTM. 

[00143] A double-frYms-splicing PTM construct (DS-PTM) was also made adding a 
5 ! splice site and a second target binding domain complementary to the second intron of 
(3HCG pre-mRNA to the 3 ! end of the toxin coding sequence of PTM+SF (Figure A). 

6.1.2. pHCG6 TARGET PRE-mRNA 
[00144] To produce the in vitro target pre-mRNA, a Sad fragment of pHCG gene 6 
(accession #X00266) was cloned into pBS(-). This produced an 805 bp insert from 
nucleotide 460 to 1265, which includes the 5 ? untranslated region, initiation codon, exon 1, 
intron 1, exon 2, and most of intron 2. For in vivo studies, an EcoRI and BamHI fragment 
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was cloned into mammalian expression vector (pc3.1DNA), producing (5HCG6. 

6.1.3. mRNA PREPARATION 
[00145] For in vitro splicing experiments, pHCG6, (3-globin pre-mRNA and different 
PTM mRNAs were synthesized by in vitro transcription of BamHI and Hindlll digested 
plasmid DNAs respectively, using T7 mRNA polymerase (Pasman & Garcia-Blanco, 1996, 
Nucleic Acids Res. 24:1638). Synthesized mRNAs were purified by electrophoresis on a 
denaturing polyacrylamide gel, and the products were excised and eluted. 

6.1.4 IN VITRO SPLICING 
[00146] PTMs and target pre-mRNA were annealed by heating at 98 °C followed by 
slow cooling to 30-34 °C. Each reaction contained 4 jil of annealed mRNA complex (100 
ng of target and 200 ng of PTM), IX splice buffer (2 mM MgCl 25 1 mM ATP, 5 mM 
creatinine phosphate, and 40 mM KCI) and 4 jal of HeLa splice nuclear extract (Promega) 
in a 12.5 (il final volume. Reactions were incubated at 30°C for the indicated times and 
stopped by the addition of an equal volume of high salt buffer (7 M urea, 5% SDS, 100 mM 
LiCl, 10 mM EDTA and 10 mM TrisHCI, pH 7.5). Nucleic acids were purified by extraction 
with phenol:chloroform:isoamyl alcohol (50:49:1) followed by ethanol precipitation. 

6.1.5. REVERSE TRANSCRIPTION-PCR REACTIONS 
[00147] RT-PCR analysis was performed using EZ-RT PCR kit (Perkin-Elmer, Foster 
City, CA). Each reaction contained 10 ng of cis- or frans-spliced mRNA, or 1-2 jxg of total 
mRNA, 0.1 ill of each 3' and 5' specific primer, 0.3 mM of each dNTP, IX EZ buffer (50 
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mM bicine, 115 mM potassium acetate, 4% glycerol, pH 8.2), 2.5 mM magnesium acetate 
and 5 U of xTth DNA polymerase in a 50 ul reaction volume. Reverse transcription was 
performed at 60°C for 45 min followed by PCR amplification of the resulting cDNA as 
follows: one cycle of initial denaturation at 94°C for 30 sec, and 25 cycles of denaturation 
at 94°C for 18 sec and annealing and extension at 60°C for 40 sec, followed by a 7 min final 
extension at 70°C. Reaction products were separated by electrophoresis in agarose gels. 

Primers used in the study were as follows: 
DT-1F: GGCGCTGCAGGGCGCTGATGATGTTGTTG 
DT-2R: GGCGAAGCTTGGATCCGACACGATTTCCTGCACAGG 
DT-3R: CATCGTCATAATTTCCTTGTG 
DT-4R: ATGGAATCTACATAACCAGG 
DT-5R: GAAGGCTGAGCACTACACGC 
HCG-R2: CGGCACCGTGGCCGAAGTGG, 
Bio-HCG-F : ACCGGAATTC ATGAAGCC AGGT AC ACCAGG 
P-globulin-F: GGGCAAGGTGAACGTGGATG 
(3-globulin-R: ATCAGGAGTGGACAGATCC 

6.1.6. CELL GROWTH. TRANSFECTTON AND mRNA ISOLATION 
[00148] Human lung cancer cell line H1299 (ATCC accession # CRL-5803) was grown 
in RPMI medium supplemented with 10% fetal bovine serum at 37°C in a 5% C0 2 
environment. Cells were transfected with pcSp+CRM (CRM is a non-functional toxin), a 
vector expressing a PTM, or vector alone (pcDNA3.1) using lipofectamine reagent (Life 
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Technologies, Gaithersburg, MD). The assay was scored for neomycin resistance (neo 1 ) 
colony formation two weeks after transfection. Four neo T colonies were selected and 
expanded under continued neo selection. Total cellular mRNA was isolated using RNA exol 
(BioChain Institute, Inc., San Leandro, CA) and used for RT-PCR. 

6.1.7. TRANS-SPUCJNG IN TUMORS IN NUDE MICE 
[00149] Eleven nude mice were bilaterally injected (except B10 ? Bll and B12 had 
1 tumor) into the dorsal flank subcutaneous space with 1 x 10 7 HI 299 human lung tumor 
cells (day 1). On day 14, the mice were given an appropriate dose of anesthesia and injected 
with, or without electroporation (T820, BTX Inc., San Diego, CA) in several orientations 
with a total volume of 100 ju\ of saline containing 100 jug pcSp+CRM with or without 
pcPHCG6 or pcPTM+Sp. Solutions injected into the right side tumors also contained India 
ink to mark needle tracks. The animals were sacrificed 48 hours later and the tumor excised 
and immediately frozen at -80°C. For analysis, 10 mg of each tumor was homogenized and 
mRNA was isolated using a Dynabeads mRNA direct kit (Dynal) following the 
manufacturers directions. Purified mRNA (2 jul of 10 fA total volume) was subjected to RT- 
PCR using pHCG-F and DT-5R primers as described earlier. All samples were re-amplified 
using DT-3R, a nested DT-A primer and biotinylated pHCG-F and the products were 
analyzed by electrophoresis on a 2% agarose gel. Samples that produced a band were 
processed into single stranded DNA using M280 Streptavidin Dynabeads and sequenced 
using a toxin specific primer (DT-3R). 
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6.2. RESULTS 
6.2.1. SYNTHESIS OF PTM 
[00150] A prototypical ^raws-splicing mRNA molecule, pcPTM+Sp (Figure 1 A) was 
constructed that included: an 18 nt target binding domain (complementary to (3HCG6 intron 
1), a 30 nucleotide spacer region, branch point (BP) sequence, a polypyrimidine tract (PPT) 
and an AG dinucleotide at the 3 ! splice site immediately upstream of an exon encoding 
diphtheria toxin subunit A (DT-A) (Uchida et al. 9 1973, J. Biol Chem. 248:3838). Later 
DT-A exons were modified to eliminate translation initiation sites at codon 14. The PTM 
constructs were designed for maximal activity in order to demonstrate trans-splicing; 
therefore, they included potent 3 r splice elements (yeast BP and a mammalian PPT) (Moore 
et aU 1993, In The mRNA World, R.F. Gesteland and J.F. Atkins, eds. (Cold Spring Harbor, 
New York: Cold Spring Harbor Laboratory Press). PHCG6 pre-mRNA (Talmadge et aL, 
1984, Nucleic Acids Res. 12:8415) was chosen as a model target as this gene is expressed 
in most tumor cells. It is not expressed in normal adult cells, with the exception of some in 
the pituitary gland and gonads. (Acevedo et aL, 1992, Cancer 76:1467; Hoon et al. 9 1996, 
Int J. Cancer 69:369; Bellet et al. 9 1997, Cancer Res. 57:516). As shown in Figure 1C, 
pcPTM+Sp forms conventional Watson-Crick base pairs by its binding domain with the 3' 
end of pHCG6 intron 1, masking the intronic 3' splice signals of the target. This feature is 
designed to facilitate ^raws-splicing between the target and the PTM. 
[00151] HeLa nuclear extracts were used in conjunction with established splicing 
procedures (Pasman & Garcia-Blanco, 1996, Nucleic Acids Res. 24:1638) to test if a PTM 
construct could invade the pHCG6 pre-mRNA target. The products of in vitro £r<ms-splicing 
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were detected by RT-PCR, using primers specific for chimeric mRNA molecules. The 
predicted product of a successful trans-splicing reaction is a chimeric mRNA comprising the 
first exon of pHCG6, followed immediately by the exon contributed from pcPTM+Sp 
encoding DT-A (Figure 1C). Such chimeric mRNAs were readily detected by RT-PCR 
using primers pHCG-F (specific to PHCG6 exon 1) and DT-3R (specific to DT-A, Figure 
2 A, lanes 1-2). At time zero or in the absence of ATP, no 466 bp product was observed, 
indicating that this reaction was both ATP and time dependent. 

[00152] The target binding domain of pcPTM+Sp contained 18 nucleotides 
complementary to (5HCG6 intron 1 pre-mRNA and demonstrated efficient trans-splicing 
(Figure 2A, lanes 1-2). Trans-splicing efficiency decreased at least 8 fold (Figure 2, lanes 3- 
4) using non-targeted PTM-Sp, which contains a non-complementary 18 nucleotide "non- 
binding domain 11 . Trans-splicing efficiencies of PTM mRNAs with or without a spacer 
between the binding domain and BP were also compared. This experiment demonstrated a 
significant increase in the efficiency of /raws-splicing by the addition of a spacer (Figure 2B, 
lanes 2 + 5). To facilitate the recruitment of splicing factors required for efficient trans- 
splicing, some space may be needed between the 3' splice site and the double-stranded 
secondary structure produced by the binding domain/target interaction. 
[00153] To investigate the effect of PTM length on ^raws-splicing specificity, shorter 
PTMs were synthesized from AccI cut PTM plasmid (see Figure 1). This eliminated 479 nt 
from the 3' end of the DT-A coding sequence. Figure 2B shows the trans-splicing ability of 
a targeted short PTM(+) (lanes 10-12), compared to a non-targeted short PTM(-) (lanes 14- 
17). Short PTM+ produced substantially more trans-spliced product (Figure 2B, lane 12) 
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than its counterpart, non-targeted short PTM (Figure 2B, lane 17). These experiments 
indicate that longer PTMs may have increased potential to mediate trans-splicing non- 
specifically. 

6.2.2. ACCURACY OF PTM SPLICEOSOME MEDIATED 
TRANS-SPUCJNG 

[00154] To confirm that ^raws-splicing between the pcPTM+Sp and PHCG6 target is 
precise, RT-PCR amplified product was produced using 5 ? biotinylated pHCG-F and non- 
biotinylated DT-3R primers. This product was converted into single stranded DNA and 
sequenced directly with primer DT-3R (DT-A specific reverse primer) using the method of 
Mitchell and Merril (1989, Anal. Biochem. 178:239). Tra^-splicing occurred exactly 
between the predicted splice sites (Figure 3), confirming that a conventional pre-mRNA can 
be invaded by an engineered PTM construct during splicing; moreover, this reaction is 
precise. 

[00155] In addition selective trans-splicing of a double splicing PTM (DS-PTM) was 
observed (Figure 8B). The DS-PTM can produce trans-splicing by contributing either a 3' 
or 5' splice site. Further, DS-PTMs can be constructed which will be capable of 
simultaneously double-^ra^-splicing, at both a 3' and 5 f site, thereby permitting exon 
replacement. Figure 8B demonstrates that in this construct the 5' splice site is most active 
at a 1 : 1 concentration of target pHCG pre-mRNA:DS-PTM. At a 1 :6 ratio the 3' splice site 
is more active. 
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6.2.3. 3' SPLTCE STTES ARE ESSENTIAL FOR PTM TRANS-SPLICING 
[00156] In general, the 3' splice site contains three elements: 1) a BP sequence located 
5' of the acceptor site, 2) a PPT consisting of a short run of pyrimidine residues, and 3) a 
YAG trinucleotide splice site acceptor at the intron-exon border (Senapathy et al, 1990, Cell 
91:875; Moore et al, 1993). Deletion or alteration of one of these sequence elements are 
known to either decrease or abolish splicing (Aebi etal, 1986; Reed & Maniatis 1988, 
Genes Dev. 2:1268; Reed, 1989, Genes Dev. 3:2113; Roscigno et al., 1993, J. Biol. Chem. 
268:1 1222; Coolidge et al, 1997, Nucleic Acids Res. 25:888). The role of these conserved 
elements in targeted /raws-splicing was addressed experimentally. In one case [(BP(-)Py(- 
)AG(-)], all three cis elements (BP, PPT and AG dinucleotide) were replaced by random 
sequences. A second splicing mutant [(Py(-)AG(-)] was constructed in which the PPT and 
the 3' splice site acceptor were mutated and substituted by random sequences. Neither 
construct was able to support trans-splicing in vitro (Figure 2A, lanes 5-8), suggesting that, 
as in the case of conventional cw-splicing, the PTM *ra/w-splicing process also requires a 
functional BP, PPT and AG acceptor at the 3' splice site. 

6.2.4. DEVELOPMENT OF A "SAFETY" SPLICE SITE 
TO INCREASE SPECTFTCTTY 

[00157] To improve the levels of target specificity achieved by the inclusion of a 

binding domain or by shortening the PTM, the target-binding domain of several PTM 

constructs was modified to create an intra-molecular stem to mask the 3' splice site (termed 

a "safety PTM"). The safety stem is formed by portions of the binding domain that partially 

base pair with regions of the PTM 3' splice site or sequences adjacent to them, thereby 
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blocking the access of spliceosomal components to the PTM 3' splice site prior to target 
acquisition (Figure 4A, PTM+SF). Base pairing between free portions of the PTM binding 
domain and (5HCG6 target region unwinds the safety stem, allowing splicing factors such 
as U2AF to bind to the PTM 3' splice site and initiate trans-splicing (Figure 4B). 
[00158] This concept was tested in splicing reactions containing either PTM+SF (safety) 
or pcPTM+Sp (linear), and both target (pHCG6) and non-target (p-globin) pre-mRNA. The 
spliced products were subsequently analyzed by RT-PCR and gel electrophoresis. Using 
PHCG-F and DT-3R primers, the specific 196 bp tows-spliced band was demonstrated in 
reactions containing pHCG target and either linear PTM (pcPTM+Sp, Figure 5, lane 2) or 
safety PTM (PTM+SF, Figure 5, lane 8). Comparison of the targeted tows-splicing between 
linear PTM (Figure 5, lane 2) and safety PTM (Figure 5, lane 8) demonstrated that the safety 
PTM tows-spliced less efficiently than the linear PTM. 

[00159] Non-targeted reactions were amplified using P-globin-F (specific to exon 1 of 
p-globin) and DT-3R primers. The predicted product generated by non-specific PTM trans- 
splicing with P-globin pre-mRNA is 189 bp. Non-specific trans-splicing was evident 
between linear PTM and p-globin pre-mRNA (Figure 5, lane 5). In contrast, non-specific 
tows-splicing was virtually eliminated by the use of safety PTM (Figure 5, lane 1 1). This 
was not unexpected, since the linear PTM was designed for maximal activity to prove the 
concept of spliceosome-mediated tows-splicing. The open structure of the linear PTM 
combined with its potent 3' splice sites strongly promotes the binding of splicing factors. 
Once bound, these splicing factors can potentially initiate tows-splicing with any 5' splice 
site, in a process similar to tows-splicing in trypanosomes. The safety stem was designed 
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to prevent splicing factors, such as U2AF from binding to the PTM prior to target 
acquisition. This result is consistent with a model that base-pairing between the free portion 
of the binding domain and the (3HCG6 target unwinds the safety stem (by mRNA-mRNA 
interaction), uncovering the 3' splice site, permitting the recruitment of splicing factors and 
initiation of ^raws-splicing. No ^raws-splicing was detected between p-globin and pHCG6 
pre-mRNAs (Figure 5, lanes 3, 6, 9 and 12). 

6.2.5. IN VITRO TRA MV-SPLTCJNG OF SAFETY PTM AND VARIANTS 
[00160] To better understand the role of c/s-elements at the 3' splice site in trans- 
splicing a series of safety PTM variants were constructed in which either the PPT was 
weakened by substitution with purines and/or the BP was modified by base substitution (see 
Table I). In vitro /raws-splicing efficiency of the safety (PTM+SF) was compared to three 
safety variants, which demonstrated a decreased ability to frara-splice. The greatest effect 
was observed with variant 2 (PTM+SFPy2), which was trans-splicing incompetent (Figure 
4C, lanes 5-6). This inhibition of trans-splicing may be attributed to a weakened PPT and/or 
the higher T m of the safety stem. In contrast, variations in the BP sequence (PTM+SFBP3) 
did not markedly effect ^ra/zs-splicing (Figure 4C, lanes 7-8). This was not surprising since 
the modifications introduced were within the mammalian branch point consensus range 
YNYURAC (where Y = pyrimidine, R = purine and N = any nucleotide) (Moore et ah, 
1993). This finding indicates that the branch point sequence can be removed without 
affecting splicing efficiency. Alterations in the PPT (PTM+SF-Pyl) decreased the level of 
/raw-splicing (lanes 3-4). Similarly, when both BP and PPT were altered PTM+SFBP3-Pyl, 
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they caused a further reduction in tams-splicing (Figure 4C, lanes 9-10). The order of trans- 
splicing efficiency of these safety variants is PTM+SF>PTM+SFBP3> 
PTM+SFPyl>PTM+SFBP3-Pyl>PTM+SFPy2. These results confirm that both the PPT and 
BP are important for efficient in vitro trans-splicing (Roscigno et al., 1993, J. Biol. Chem. 
268:11222). 

6-2.6. COMPETTTTO N BETWF.KM CIS- AND TRANR- SPLTCTNG 
[00161] To determine if it was possible to block pre-mRNA cw-splicing by increasing 
concentrations of PTM, experiments were performed to drive the reaction towards trans- 
splicing. Splicing reactions were conducted with a constant amount of J3HCG6 pre-mRNA 
target and various concentrations of tarns-splicing PTM. Os-splicing was monitored by RT- 
PCR using primers to pHCG-F (exon 1) and 0HCG-R2 (exon 2). This amplified the 
expected 125 bp cis-spliced and 478 bp unspliced products (Figure 6A). The primers pHCG- 
F and DT-3R were used to detect trans-sphctd products (Figure 6B). At lower 
concentrations of PTM, m-splicing (Fig. 6A, lanes 1-4) predominated over trans-splicing 
(Figure 6B, lanes 1-4). Os-splicing was reduced approximately by 50% at a PTM 
concentration 1.5 fold greater than target. Increasing the PTM mRNA concentration to 3 
fold that of target inhibited cw-splicing by more than 90% (Figure 6 A, lanes 7-9), with a 
concomitant increase in the trans-spliced product (Figure 6B, lanes 6-10). A competitive 
RT-PCR was performed to simultaneously amplify both cis and rra/w-spliced products by 
including all three primers (pHCG-F, HCG-R2 and DT-3R) in a single reaction. This 
experiment had similar results to those seen in Figure 6, demonstrating that under in vitro 
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conditions, a PTM can effectively block target pre-mRNA eis-splicing and replace it with 
the production of an engineered trans-spliced chimeric mRNA. 

6.2.7. TRANS-SPUCmG TN TTSSTTE CIJLTTIRF. 
[00162] To demonstrate the mechanism of trans-splicing in a cell culture model, the 
human lung cancer line H1299 (0HCG6 positive) was transfected with a vector expressing 
SP+CRM (a non-functional diphtheria toxin) or vector alone (pcDNA3.1) and grown in the 
presence of neomycin. Four neomycin resistant colonies were individually collected after 
14 days and expanded in the continued presence of neomycin. Total mRNA was isolated 
from each clone and analyzed by RT-PCR using primers 0HCG-F and DT-3R. This yielded 
the predicted 196 bp trans-spliced product in three out of the four selected clones (Figure 7A, 
lanes 2, 3 and 4). The amplified product from clone #2 was directly sequenced, confirming 
that PTM driven trans-splicing occurred in human cells exactly at the predicted splice sites 
of endogenously expressed pHCG6 target exon 1 and the first nucleotide of DT-A (Figure 
7B). 

6.2.8. TRANS-SiPl JCING TN AN TN VTVO MODFJ . 
[00163] To demonstrate the mechanism of trans-splicing in vivo, the following 
experiment was conducted in athymic (nude) mice. Tumors were established by injecting 
10 7 H1299 cells into the dorsal flank subcutaneous space. On day 14, PTM expression 
plasmids were injected into tumors. Most tumors were then subjected to electroporation to 
facilitate plasmid delivery (see Table 2, below). After 48 hrs, tumors were removed, poly-A 
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mRNA was isolated and amplified by RT-PCR. 7raws-splicing was detected in 8 out of 19 
PTM treated tumors. Two samples produced the predicted frans-spliced product (466 bp) 
from mRNA after one round of RT-PCR. Six additional tumors were subsequently positive 
for trans-splicing by a second PCR amplification using a nested set of primers that produced 
the predicted 196 bp product (Table 2). Each positive sample was sequenced, demonstrating 
that pHCG6 exon 1 was precisely trans-spliced to the coding sequence of DT-A (wild type 
or CRM mutant) at the predicted splice sites. Six of the positive samples were from 
treatment groups that received cotransfected plasmids, pcPTM+CRM and pcHCG6, which 
increased the concentration of target pre-mRNA. This was done to enhance the probability 
of detecting trans-spliced events. The other two positive tumors were from a group that 
received only pcPTM+Sp (wild type DT-A). These tumors were not transfected with 
(3HCG6 expression plasmid, demonstrating once again, as in the tissue culture model 
described in Section 6.2.7, that trans-splicing occurred between the PTM and endogenous 
PHCG6 pre-mRNA produced by tumor cells. 
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Table 2. Trans-splicing in tumors in nude mice. 


Mouse 


Plasmid 


Left Right 


Electroporation 


RT-PCR 
Left Right 


Nested PCR 


Nucleotide Sequence 


t-DI 

Bl 


pCMV-oport 












B2 


pCMV-Sport 


Bl-3 Bl-4 


a 1000V/cm 


- 


- 


- 


B3 


pcSp+CRM 


B3-1 B3-2 


a 1000V/cm 


- 


- 


- 






B3-3 B3~4 


a 1000V/cm 








III B4 


pcSp+CRM 


B4-1 B4-2 


b 50V/cm 








Co 




B4-3 B4-4 


c 25V/cm 


- 


_ 


- 




pcSp+CRM/ 
pcHCG6 


B5-1 B5-2 


a 1000V/cm 






ATGTTCCAG i GGCGTGATGAT 
(otiy ID JNU. j j ) 






B5-3 B5-4 


a 1000V/cm 




+ + 


ATGTTCCAG I GGCGTGATGAT 
(SEQ ID NO:53) 


B6 


pcSp+CRM/ 
pcHCG6 


B6-1 B6-2 


b 50V/cm 












B6-3 B6-4 


c 25V/cm 




-t- +■ 


ATGTTCCAG i GGCGTGATGAT 
(SEQ ID NO:53) 


B7 


pc PTM+Sp 


B7-1 


a 1000V/cm 








B8 


pcPTM+Sp 


B8-1 


b 50V/cm 




+ 


ATGTTCCAG I GGCGTGATGAT 
(SEQ ID NO:53) 


J B9 


pc PTM+Sp 


B9-1 






+ 


ATGTTCCAG I GGCGTGATGAT 
(SEQIDNO:53) 



a : 6 pulses of 99 /us sets of 3 pulses administered orthogonally 
b : 8 pulses of 10ms sets of 4 pulses administered orthogonally 
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°: 8 pulses of 50ms sets of 4 pulses administered orthogonally 
+ : positive for RT-PCR trans-spliced produce 
did not receive electroporation 

7. EXAMPLE: lacL 77?>4MT-SPLTCTNG MODET , 
[001 64] In order to demonstrate and evaluate the generality of the mechanism of spliceosome mediated 
targeted tomy-splicing between a specific pre-mRNA target and a PTM, a simple model system based on 
expression of enzyme (3-galactosidase was developed. The following section describes results demonstrating 
successful splicesome mediated targeted trans-splicing between a specific target and a PTM. 



7.1. MATERIALS AND METHODS 

7.1.1. PRIMER SHOT IENCES 

[001 65] The following primers were used for testing the lacZ model system: 

5* Lac- IF GCATGAATTCGGTACCATGGGGGGGTTCTCATCATCATC 

5' Lac-IR CTGAGGATCCTCTTACCTGTAAACGCCCATACTGAC 

3' Lac-IF GCATGGTAACCCTGCAGGGCGGCTTCGTCTGGGACTGG 

3' Lac-IR CTGAAAGCTTGTTAACTTATTATTTTTGACACCAGACC 

3' Lac-Stop GCATGGTAACCCTGCAGGGCGGCTTCGTCTAATAATGGGACTGGGTG 

HCG-InlF GCATGGATCCTCCGGAGGGCCCCTGGGCACCTTCCAC 

HCG-InlR CTGACTGCAGGGTAACCGGACAAGGACACTGCTTCACC 

HCG-Ex2FGCATGGTAACCCTGCAGGGGCTGCTGCTGTTGCTG 

HCG-Ex2RCTGAAAGCTTGTTAACCAGCTCACCATGGTGGGGCAG 

Lac-TRl (Biotin): 7-GGCTTTCGCTACCTGGAGAGAC 

Lac-TR2 GCTGGATGCGGCGTGCGGTCG 

HCG-R2: CGGCACCGTGGCCGAAGTGG 



7. 1.2. CONSTRUCT ION OF THE lacL PRE-mRNA TARGET MOLECTTTF. 
[00166] The lacZ target 1 pre-mRNA (pc3. 1 lacTl) was constructed by cloning of the 
following three PCR products: (i) the 5' fragment of lacZ; followed by (ii) (3HCG6 intron 
1; (iii) and the 3' fragment of lacZ. The 5' and 3' fragment of the lacZ gene were PCR 
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amplified from template pcDNA3.1/His/lacZ (Invitrogen,San Diego, CA) using the 
following primers: 5' Lac-IF and 5'Lac-lR (for 5 ! fragment), and 3'Lac-lF and 3' Lac-IR 
(for y fragment). The amplified lacZ 5' fragment is 1788 bp long which includes the 
initiation codon, and the amplified 3 ! fragment is 1385 bp long and has the natural 5' and 3 ! 
splice sites in addition to a branch point, polypyrimidine tract and (5HCG6 intron 1. The 
(3HCG6 intron 1 was PCR amplified using the following primers: HCG-InlF and HCG- 
InlR. 

[00167] The lacZ target 2 is an identical version of lacZ target 1 except it contains two 
stop codons (TAA TAA) in frame four codons after the 3 ! splice site. This was created by 
PCR amplification of the 3 ! fragment (lacZ) using the following primers: 3' Lac-Stop and 3 ! 
Lac 1R and replacing the functional 3 ! fragment in lacZ target 1. 

7.1.3. CONSTRUCTION OF pc3.1 PTM1 and pc3.1 PTM2 
[00168] The pre-trans-splicing molecule, pc3.1 PTM1 was created by digesting pPTM 
+Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A toxin with 
the a DNA fragment encoding the functional 3' end of lacZ. This fragment was generated 
by PCR amplification using the following primers: 3' Lac-IF and 3 f Lac-IR. For cell culture 
experiments, an EcoRI and Hindlll fragment of pc3.1 PTM2 which contains the binding 
domain to HCG intron 1, a 30 bp spacer, a yeast branch point (TACTAAC), and strong 
polypyrimidine tract followed by the lacZ cloned was cloned into pcDNA3.1. 
[00169] The pre-trans-splicing molecule, pc3.1 PTM2 was created by digesting pPTM 
+Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A toxin with 
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the pHCG6 exon 2. pHCG6 exon 2 was generated by PCR amplification using the following 
primers: HCG-Ex2F and HCG-Ex2R. For cell culture experiments, an EcoRI and Hindlll 
fragment of pc3.1 PTM2 which contains the binding domain to HCG intron 1, a 30 bp 
spacer, a yeast branch point (TACTAAC), and strong polypyrimidine tract followed by the 
PHCG6 exon 2 cloned was used. 

7. 1 .4. CO-TRANSFECTION OF THE lacZ SPLICE TARGET 
PRE-mRNA AND PTMS TNTO 293T CELT S 

[00170] Human embryonic kidney cells (293T) were grown in DMEM medium 
supplemented with 10% FBS at 37°C in a 5% C0 2 . Cells were co-transfected with pc3.1 
LacTl andpc3.1 PTM2, orpc3.1 LacT2 andpc3.1 PTM1, using Lipofectamine Plus (Life 
Technologies,Gaithersburg, MD) according to the manufacturer's instructions. 24 hours 
post-transfection, the cells were harvested; total RNA was isolated and RT-PCR was 
performed using specific primers for the target and PTM molecules. P-galactosidase activity 
was also monitored by staining the cells using a P-gal staining kit (Invitrogen, San Deigo. 
CA). 

7.2. RESULTS 

7.2.1. THE lacZ SPLICE TARGET OS-SPLICES EFFICIENTLY TO 
PRODI ICE F UNCTIONAL p-GALACTOSTD A SF. 

[00171] To test the ability of the splice target pre-mRNA to cis-splice efficiently, pc3.1 

lacTl was transfected into 293 T cells using Lipfectamine Plus reagent (Life 

Technologies,Gaithersburg, MD) followed by RT-PCR analysis of total RNA. Sequence 
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analysis of the ds-spliced RT-PCR product indicated that splicing was accurate and occurred 
exactly at the predicted splice sites (Fig. 12B). In addition, accurate cis-splicing of the target 
pre-mRNA molecule results in formation of a mRNA capable of encoding active p- 
galactosidase which catalyzes the hydrolysis of P-galactosidase, i.e., X-gal, producing a blue 
color that can be visualized under a microscope. Accurate as-splicing of the target pre- 
mRNA was further confirmed by successfully detecting P-galactosidase enzyme activity. 
[001 72] Repair of defective lacZ target 2 pre-mRNA by trans-splicing of the functional 
3' lacZ fragment (PTM1) was measured by staining for P-galactosidase enzyme activity. For 
this purpose, 293T cells were co-transfected with lacZ target 2 pre-mRNA (containing a 
defective 3' fragment) and PTM1 (contain normal 3' lacZ sequence). 48 hours post- 
transfection cells were assayed for p-galactosidase enzyme activity. Efficient trans-splicing 
of PTM1 into the lacZ target 2 pre-mRNA will result in the production of functional P- 
galactosidase activity. As demonstrated in Figure 1 1B-E, trans-splicing of PTM 1 into lacZ 
target 2 results in restoration of p-galactosidase enzyme activity up to 5% to 10% compared 
to control. 

7.2.2. TARGETED TTWA/S-SPLICING BETWEEN 
THE lacZ TARGF.T PRE-mRNA and PTM?. 

[00173] To assay for trans-splicing, lacZ target pre-mRNA and PTM2 were transfected 

into 293 T cells. Following transfection, total RNA was analyzed using RT-PCR. The 

following primers were used in the PCR reactions: facZ-TRl (lacZ 5' exon specific) and 

HCGR2 (pHCGR exon 2 specific). The RT PCR reaction produced the expected 195 bp 

£ra«s-spliced product ( Fig. 11, lanes 2 and 3) demonstrating efficient £ra«s-splicing between 

the lacZ target pre-mRNA and PTM 2. Lane 1 represents the control, which does not contain 
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PTM2. 

[00174] The efficiency of the trans-splicing was also measured by staining for (3- 
galactosidase enzyme activity. To assay for trans-splicing, 293T cells were co-transfected 
with lacZ target pre-mRNA and PTM 2. 24 hours post-transfection, cells were assayed for 
p-galactosidase activity. If there is efficient trans-splicing between the target pre-mRNA and 
the PTM, a chimeric mRNA is produced consisting of the 5' fragment of the lacZ target pre- 
mRNA and (3HCG6 exon 2 is formed which is incapable of coding for an active (J- 
galactosidase. Results from the co-transfection experiments demonstrated that trans- 
splicing of PTM2 into lacZ target 1 resulted in the reduction of p-galactosidase activity by 
compared to the control. 

[00175] To further confirm that /raws-splicing between the lacZ target pre-mRNA and 
PTM2 is accurate, RT-PCR was performed using 5 f biotinylated lacZ-TRl and 
non-biotinylated HCGR2 primers. Single stranded DNA was isolated and sequenced directly 
using HCGR2 primer (HCG exon 2 specific primer). As evidenced by the sequence of the 
splice junction, trans-splicing occurred exactly as predicted between the splice sites (Fig. 
12A and 12B), confirming that a conventional pre-mRNA can be invaded by an engineered 
PTM during splicing, and moreover, that this reaction is precise. 

8. EXAMPLE: CORRECTION OF THE CYSTIC FIBROSIS 
TRANSMEMBRANE REGUALTOR GENE 

[00176] Cystic fibrosis (CF) is one of the most common genetic diseases in the world. 

The gene associated with CF has been isolated and its protein product deduced (Kerem, B.S. 

et al., 1989, Science 245:1073-1080; Riordan et aL, 1989, Science 245: 1066-1 073 ;Rommans, 
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et al., 1989, Science 245:1059-1065). The protein product of the CF associated gene is 
referred to as the cystic fibrosis trans-membrane conductance regulator (CFTR). The most 
common disease-causing mutation which accounts for -70% of all mutant alleles is a 
deletion of three nucleotides in exon 10 that encode for a phenylalanine at position 508 
(AF508). The following section describes the successful repair of the cystic fibrosis gene 
using spliceosome mediated ^raws-splicing and demonstrates the feasibility of repairing 
CFTR in a model system. 

8.1 MATERIALS AND METHODS 
8.1.1. PRE-TRANS-SPLICING MOLECULE 
[00177] The CFTR pre-trans-splicing molecule (PTM) consists of a 23 nucleotide 
binding domain complimentary to CFTR intron 9 (3 ! end, -13 to -31), a 30 nucleotide spacer 
region (to allow efficient binding of spliceosomal components), branch point (BP) sequence, 
polypyrimidine tract (PPT) and an AG dinucleotide at the 3' splice site immediately upstream 
of the sequence encoding CFTR exon 10 (wild type sequence containing F508). This initial 
PTM was designed for maximal activity in order to demonstrate ^rara-splicing; therefore the 
PTM included a UACUAAC yeast consensus BP sequence and an extensive PPT. An 18 
nucleotide HIS tag (6 histamine codons) was included after wild type exon 10 coding 
sequence to allow specific amplification and isolation of the trans-spliced products and not 
the endogenous CFTR. The oligonucleotides used to generate the two fragments included 
unique restriction sites. (Apal and PstI, and PstI and NotI, respectively) to facilitate directed 
cloning of amplified DNA into the mammalian expression vector pcDNA3.1. 
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8.1.2. THE TARGET CFTR PRE-mRNA MTNT-GENP. 
[00178] The CFTR mini-gene target is shown in Figure 13 and consists of CFTR exon 
9 ; the functional 5' and 3' regions of intron 9 (260 and 265 nucleotides from each end, 
respectively); exon 10 [AF508]; and the 5' region of intron 10 (96 nucleotides). In addition, 
as depicted in Figure 16, a mini-target gene comprising CFTR exons 1-9 and 10-24 can be 
used to test the use of spliceosome mediated trans-splicing for correction of the cystic 
fibrosis mutation. Figure 17, shows a double splicing PTM that may also be used for 
correction of the cystic fibrosis mutation. As shown, the double splicing PTM contains 
CFTR BD intron 9, a spacer, a branch point, a polypyrimidine tract, a 3' splice site, CFTR 
exon 10, a spacer, a branch point, a polypyrimidine tract, a 5' splice site and CFTR BD exon 
10. 



8.1.3. OLIGONT JCT ,EOTTDESi 
[00179] The following oligonucleotides were used to create CFTR PTM: 



Forward CF3 

ACCT GGGCCC ACC CAT TAT TAG GTC ATT AT CCGCGG AAC ATT ATA 
Apal site. Intron 9 CFTR, -12 to -34. 



Reverse 


CF4 






ACCT 


CIGCKGGTGACC CTG CAG GAA AAA AAA GAA G 






Pstl. BstEI. PPT. 


Forward 


CF5 






ACCT 


CTGCAG ACTTCACTTCTAATGATGAT 






Pstl. Exon 10 CFTR, +1 to +24 


Reverse 


CF6 





ACCT GCGGCCGC CTA ATG ATG ATG ATG ATG ATG CTC TTC TAG TTG GCA TGC 
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Not I. Stop Polyhistamine tag Exon 10 CFTR, +15 to +132 

[001 80] The following nucleotides were used to create the CFTR TARGET pre-mRNA 
mini gene (Exon 9 + mini-Intron 9 + Exon 10 + 5' end Intron 10): 



Forward CF18 

GACCT CTCGAG GGA TTT GGG GAA TTA TTT GAG 
Xhol Exon 9 CFTR, 1 to 21. 

Reverse CF19 

CTGACCT GCGGCCGC TAC AGT GTT GAA TGT GGT GC 
Notl. Intron 9 5' end. 

3 Forward CF20 

CO CTGACCT GCGGCCGC CCA ACT ATC TGA ATC ATG TG 

yj Notl. Intron 9 3' end. 

£g Reverse CF21 

~] GACCT CTTAAG TAG ACT AAC CGA TTG AAT ATG 

m Aflll Intron 10 5' end. 

J00181] The following oligonucleotides were used for detection of trans-spliced products: 



^ Reverse Bio-His 

N= CTA ATG ATG ATG ATG ATG ATG 

Stop. Polyhistidine tag (5' biotin label). 

Reverse Bio-His(2) 

CGC CTA ATG ATG ATG ATG ATG 

3' UT Stop. Polyhistidine tag (5' biotin label). 

Forward CF8 

CTT CTT GGT ACT CCT GTC CTG 
Exon 9 CFTR. 

Forward CF18 

GACCT CTCGAG GGA TTT GGG GAA TTA TTT GAG 
Xhol. Exon 9 CFTR. 

Reverse CF28 

AAC TAG AAG GCA CAG TCG AGG 
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Pc3.1 vector sequence (present in PTM 3' UT but not target). 

8.2. RESULTS 

[00182] The PTM and target pre-mRNA were co-transfected in 293 embryonic kidney cells using 
lipofectamine (Life Technologies,Gaithersburg, MD). Cells were harvested 24 h post transfection and 
RNA was isolated. Using PTM and target-specific primers in RT-PCR reactions, a ^raws-spliced 
product was detected in which mutant exon 10 of the target pre-mRNA was replaced by the wild type 
exon 10 of the PTM (Figure 14). Sequence analysis of the trans-spliced product confirmed the 
restoration of the three nucleotide deletion and that splicing was accurate, occurring at the predicted 
splice sites (Figure 15), demonstrating for the first time RNA repair of the cystic fibrosis gene, CFTR 
(Mansfield et al., 2000, Gene Therapy 7:1885-1895). 

9. EXAMPLE: DOUBLE- TRA A/S-SPLICING 
[00183] The following example demonstrates accurate replacement of an internal exon 
by a double-frans-splicing between a target pre-mRNA and a PTM RNA containing both 3 ! 
and 5' splice sites leading to production of full length functionally active protein. 
[00184] As described herein, any pre-mRNA can be reprogrammed by providing a 
^raws-reactive RNA molecule containing either a 3 ! -splice site, a 5-splice site or both. The 
following example describes successful targeting and replacement of a single internal exon 
utilizing pre-£raras-splicing molecules (PTMs) containing both the 5' and 3' splice sites. Such 
PTMs can promote two fraws-splicing reactions with the intended target gene mediated by 
the splicesome(s). To test this mechanism, a splicing lacZ model target gene consisting of 
lacZ 5' "exon" - CFTR mini-intron 9 - CFTR exon 10 (AF508) - CFTR mini-intron 10 
followed by lacZ 3 ? "exon" was created. In this target transcript, a 124 bp central portion of 
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the (3-galactosidase ORF was substituted by exon 10 (AF508) of CFTR, thus it produces non- 
functional protein. A PTM consisting of the missing 124 bp lacZ "mini-exon" and a 5 ? and 
3' trans-splicing domain containing binding domains (BDs) complementary to the target 
introns and exons was created. Transfection of HEK 293T cells with either target alone or 
PTM alone showed no detectable levels of p-gal activity. In contrast, 293T cells transfected 
with target plus PTM produced substantial levels of p-gal activity indicating the restoration 
of protein function. The accuracy of /raw-splicing between the target and PTM was 
confirmed by sequencing the appropriate RT-PCR product, which revealed the predicted 
internal exon substitution. The feasibility of this approach in a disease model was tested by 
replacing the CFTR AF508 exon 10 with normal exon 10 containing F508 in cystic fibrosis. 
These results demonstrate that a ^raw-splicing technology can be easily adapted to correct 
many of the genetic defects whether they are associated with the 5' exon or 3' exon or any 
internal exon of the gene. 

[00185] Figure 18 is a schematic of a model lacZ target consisting of lacZ 5 f exon - 
CFTR mini-intron 9 - CFTR exon 10 (delta 508) - CFTR min-intron 10 followed by the lacZ 
3' exon. In this target, a 124 bp central portion of the lacZ gene is substituted with CFTR 
exon 10 which has a mutation at position 508 (delta 508). The pre-mRNA target undergoes 
normal as-splicing to produce an mRNA consisting of lacZ 5 ! exon - CFTR exon 10 (delta 
508) followed by the lacZ 3* exon. Because of the disruption in p-galactosidase ORF it 
produces truncated proteins which are non-functional. 

[00186] To restore p-gal function by double-^rarcs-splicing, three PTMs were created 
consisting of the missing 124 bp lacZ "mini-exon" and a 5' and 3' frara-splicing domain 
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containing binding domains complementary to the target introns and exons as shown in 
Figure 19. These PTMs have an 120 bp 3 1 binding domain (complementary to intron 9) from 
PTM24 (see below) used in 3' exon replacement, spacer sequence, yeast branch point, 
polypyrimidine tract, 3' acceptor AG dinucleotide, lacZ "mini-exon", 5 f splice site, spacer 
sequence followed by the 5 1 binding domain. These PTMs differ only in their 5 1 binding 
domain sequences. DSPTM5 has a 27 bp BD which is complementary to intron 10 and 
blocks just the 5' splice site of the target. DSPTM6 has 120 bp 5 f BD and covers both 5' and 
y splice sites of the target, while, DSPTM7 has 260 bp BD which masks both the splice sites 
(5' and 3 1 ) and also covers the entire exon of the target. 

[00187] A schematic representation of a double-Zra^-splicing reaction showing the 
binding of DSPTM7 with DSCFTL6 target pre-mRNA is shown in Figure 20. 3' BD: 120 
bp binding domain complementary to mini-intron 9; 5' BD (260 bp); second binding domain 
complementary to mini-intron 10 and exon 10. ss: splice sites; BP: branch point, and PPT: 
polypyrimidine tract. 

[00188] The important structural elements of DSPTM7 (Figure 21) are as follows: 
m3'BDQ20 BP) : 

GATTCACTTGCTCCAATTATCATCCTAAGCAGAAG 
TGTATATTCTTATTTGTAAAGATTCTATTAACTCATTTGA 
TTCAAAATATTTAAAATACTTCCTGTTTCATACTCTGCTA 
TGCAC 

(2) Spacer sequences (24 bp) : AACATTATTATAACGTTGCTCGAA 

(3) Branch point, pyrimidine tract and acceptor splice site : 

3 1 ss 

BP Kpnl PPT EcoRV ikcZ mini-exon 

TACTAAC T GGTACC TCTTCTTTTTTTTTT GATATC CTGCAG GGC GGC I 
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(4) 5' donor site and 2 nd spacer sequence : 

5' ss 

lacZ mini-exon I 

TGAACG ) GTAAGT GTTATCACCGATATGTGTCTAACCTGATTCGGGCCTTC 
GATACGCTAAGATCCACCGG 

(5) 5' BP (260 BP) : 

TCAAAAAGTTTTCACATAATTTCTTACCTCTTCTTG 
AATTCATGCTTTGATGACGCTTCTGTATCTATATTCATCA 
TTGGAAACACCAATGATTTTTCTTTAATGGTGCCTGGCAT 
AATCCTGGAAAACTGATAACACAATGAAATTCTTCCACT 
GTGCTTAAAAAAACCCTCTTGAATTCTCCATTTCTCCCAT 
AATCATCATTACAACTGAACTCTGGAAATAAAACCCATC 
ATTATTAACTCATTATCAAATCACGC 

[001 89] To determine whether the restoration of P-gal function is RNA trans-splicing 

mediated, the mutants are depicted in Figure 22. DSPTM8 is a 3' splice mutant in which the 

3' splice elements such as BP, polypyrimidme tract and the 3* acceptor AG dinucleotides 

were deleted and replaced with random sequences. This PTM still has 3' and 5' binding 

domains and the functional 5' splice site. PTM29 lacks the 2 nd binding domain + 5' ss but 

still has the 3' binding domain 3' splice site, while PTM30 lacks the 1 st binding domain + 3' 

splice site but has the functional 5' splice site and 2 nd binding domain. 

[00190] To examine the double-rraw-splicing mediated restoration of P-gal function, 

293T cells were either transfected with 2 fu.g of target or PTM alone or co-transfected with 

2 peg of target + 1 .5 fig of PTM using Lipofectamine Plus reagent. 48 hrs. after transfection, 

total RNA was isolated and analyzed by RT-PCR using Kl -IF and Lac-6R primers. These 

primers amplify both cis- and trans-spliced products in a single reaction which were 

identified based on the size. The czs-spliced product is 295 bp in size while the /raws-spliced 

product is 230 bp in size. To confirm that fraras-splicing between DSPTM7 and DSCFT1.6 
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pre-mRNA is precise, RT-PCR amplified products were excised, re-amplified using K1-2F 
and Lac-6R primers and sequenced directly using K1-2F or Lac-6R primers. As shown in 
Figure 23 *r<ms-splicing occurred exactly at the predicted splice sites, confirming the precise 
internal exon substitution by two /ra/w-spHcing events. 

[00191] The repair of defective lacZ pre-mRNA by double trans-splicing events and 
subsequent production of full-length p-gal protein was investigated in co-transfection assays. 
293T cells were co-transfected with DSCFT1.6 target and DSPTM7 expression plasmids, 
as well as with DSCFT1.6 target or DSPTM7 alone as controls. Western blot analysis of 
total cell lysates using polyclonal anti-P-galactosidase antiserum specifically recognized a 
~ 120 kDa protein only in cells co-transfected with DSCFTL6 target + DSPTM7 plasmids 
(Fig. 24, lanes 3 and 4) but not in cells transfected with either DSCFT1 .6 target (Lane 1) or 
DSPTM7 plasmid alone (Lane 2). Similarly, no full-length protein was detected in cells co- 
transfected with DSCFT1.6 target + 3 ! splice mutant (Lane 5 and 6) or PTM29 or 30 in 
which either 3 r fraws-splicing domain or 5 1 rra^-splicing domains has been deleted (Lane 
7). In addition, the 120 kDa protein band co-migrated with the full-length functional p-gal 
produced using lacZ-Tl plasmid (positive control, data not shown). These results not only 
confirmed the production of full-length protein by double-fra/ts-splicing between the target 
and PTM but also demonstrated that both the 3' splice site and 5' splice sites are essential for 
this process. 

[00192] To determine whether the full-length protein produced by double-^rara-splicing 
between the target pre-mRNA and DSPTM7 RNA is functionally active, 293T cells were co- 
transfected with DSCFT1.6 targeted + one of the double splicing PTMs 5, 6 or 7 expression 
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plasmids, or transfected with DSCFT1.6 target or DSPTM7 alone. Total cell extracts were 
prepared and assayed for p-gal activity using ONPG assay (Invitrogen). p-gal activity in 
extracts prepared from cells transfected with either DSCFT1 .6 target or DSPTM7 alone was 
almost identical to the background levels detected in mock transfection (Fig. 25). In 
contrast, 293T cells co-transfected with DSCFTL6 target and DSPTM7 produced - 21 fold 
higher levels of (3-gal activity over the background (Fig. 25). These results confirmed the 
accurate double-Zrarcs-splicing between the target pre-mRNA and PTM RNA and production 
of the full-length functional protein. 

[00193] To confirm that restoration of p-gal activity by double-Zrara-splicing reaction 
is absolutely depended on the presence of both 3 ! and 5 ! splice sites of the PTM, we 
constructed several mutants: (a) DSPTM8, is identical to DSPTM7 except the functional 3' 
spice elements (branch point, polypyrimidine tract and the 3 1 acceptor AG dinucleotides) 
were deleted and substituted with random sequences (see Fig. 22 for details); (b) PTM29 
lacks 5' splice site as well as the 5' binding domain but has the 3' binding domain and 3' 
splice site, and (c) PTM30 lacks 3' binding domain and 3' splice site but has the 5* splice site 
and 5 ? binding domain, p-gal activity in extracts prepared from cells transfected with either 
DSCFT1.6 target or DSPTM7 alone was almost identical to the background levels detected 
in mock transfection (Fig. 26). Similarly, no significant increase in p-gal activity was 
detected in cells transfected with either DSPTM8 alone (3' splice site mutant) or co- 
transfection of DSCFT1 .6 target + one of the above mutant PTMs. On the other hand, cells 
co-transfected with DSCFT1.6 target and DSPTM7 with functional 3' and 5' splice sites 
produced substantial levels of p-gal activity over the background (Fig. 26). These results 
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confirmed the requirement of both splice sites in the double-splicing PTM and also 
eliminated the possibility that restoration of p-gal activity was due to complementation 
between the truncated proteins (Fig. 26). 

[00194] Different concentrations of the target and PTM were co-transfected and 
analyzed for p-gal activity restoration. As expected, 293T cells co-transfected with 
DSCFT1.6 target + DSPTM7 showed substantial levels of p-gal activity (~ 30 fold) over the 
controls. Increasing the concentrations of the PTM by 2 and 3 fold did increase the level of 
p-gal activity, but not significantly (Fig. 27). These results further confirmed the double- 
trans-splicing mediated restoration of P-gal enzyme function. 

[00195] The specificity of double-fraws-splicing reaction was examined by constructing 
a non-specific target (DSHCGT1 . 1) which is similar to that of specific target (DSCFT1 .6) 
but has pHCG intron 1 - pHCG exon 2 and pHCG intron 2 instead of CFTR mini-intron 9 
- CFTR exon 10 (delta 508) and CFTR mini-intron 10 (Fig. 28). RT-PCR analysis of the 
total RNA isolated from cells transfected with either DSHCGT1.1 (non-specific target) alone 
or in combination DSPTM7 (targeted to DSCFT1.6 target) failed to produce the expected 
3 14 bp double-Zrara-spliced product. On the other hand, RT-PCR analysis of the total RNA 
prepared from cells co-transfected with specific target + PTM produced the expected 314 pb 
product. This was further confirmed by p-gal activity assay of the total cellular extract. The 
level p-gal activity detected in cells transfected with non-specific target alone or in 
combination with DSPTM7 targeted to DSCFT1.6 target was almost identical to the 
background level. In contrast substantial levels of p-gal activity was detected in cells co- 
transfected with specific target (DSCFT1.6) + DSPTM7 (Fig. 27). These results confirmed 
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that the double-rraws-splicing is highly specific. 

[00196] The repair model in Fig. 30 shows a portion of a target CFTR pre-mRNA 
consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini- 
intron 10 and exons 11-24 (Fig. 30). The PTM shown in the figure consists of exon 10 
coding sequences (containing codon 508) and two /nms-splicing domains each with its own 
splicing elements (acceptor and donor sites, branchpoint and pyrimidine tract) and a binding 
domain complementary to intron 9 splice site, part of exon 10 (5 r and 3 r ends) and intron 10 
5 ! splice site (Fig. 31 (DS-CF1)). Exon 10 of the PTM also has modified codon usage 
throughout to reduce antisense effects between exon 10 of the PTM and it ! s own binding 
domains and for PTMs that have binding domains which are complementary to exon 
sequences (Fig. 31). A double-frafw-splicing event between the PTM and target should 
produce a repaired full-length mRNA. 

[00197] Fig. 32 shows the sequence of a single PCR product showing target exon 9 
correctly spliced to PTM 20 exon 10 (with modified codons) (upper panel), codon 508 in 
exon 10 of the PTM (middle panel) and PTM exon 10 correctly spliced to target exon 1 1 
(lower panel). The sequence of a repaired target was generated by RT-PCR followed by 
PCR. 



10. EXAMPLE: TRANS-SPLICING REPAIR OF THE 

CYSTIC FIBROSIS GENE USING A PTM 

THAT CAN PERFORM 5' EXON REPLACEMENT 

[00198] The key advantage of using 5 ! exon replacement for gene repair are 

(a) it permits replacement of the 5' portion of a gene 
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(b) the construct requires less sequence and space than a full-length gene construct, 

(c) PTMs can be produced that lack a polyA signal which should prevent PTM 
translation, and (d) the 5' end can be modified to increase translation. 

10.1 MATERIALS AND METHODS 
10.1.1 PLASMTD CONSTRUCTION 
[00199] The CFTR coding sequences (exons 1-10) for PTM30 were generated by PCR 
using a partial cDNA plasmid template (61160; American Type Culture Collection, 
Manassas, VA). The £rans-splicing domain (TSD) [including the binding domain, spacer 
sequence, polypyrimidine tract (PPT), branchpoint (BP) and 3 f splice site] was generated 
from a PCR product (using an existing plasmid template) and by annealing oligonucleotides. 
The different fragments (the TSD and coding sequences) were then cloned into pcDNA3. l(-) 
using appropriate restriction sites. Oligodeoxynucleotide primers were procured from Sigma 
Genosys (The Woodlands, TX). All PCR products were generated with either KEDTaq 
(Sigma, St. Louis, MO), or cloned Pfu (Stratagene, La Jolla, CA) DNA Polymerase. PCR 
primers for amplification contained restriction sites for directed cloning. PCR products were 
digested with the appropriate restriction enzymes and cloned into the mammalian expression 
plasmid pc3.1DNA(-) (Invitrogen, Carlsbad, CA). 

10.2 CELL CULTURE AND TR ANSFECTTONS 
[00200] Constructs were cotransfected in human embryonic kidney (HEK) 293T or 
293 cells (1.25 x 10 6 cells per 60 mm poly-d-lysine coated dish) using LipofectaminePlus 
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(Life Technologies, Gaithersburg, MD) and the cells were harvested 48 h after the start of 

transfection. Total RNA was isolated as described in the manufacturers instructions 

(Epicenter Technologies, Inc.). HEK 293T cells were grown in Dulbecco's Modified Eagle's 

Medium (Life Technologies) supplemented with 10% v/v fetal bovine serum (Hyclone, Inc., 

Logan, UT). All cells were kept in a humidified incubator at 37 °C and 5% C0 2 . 

10.1.3 REVERSE TRANSCRIPTION-POL YMERASE CHAIN 
REACTION fTR-PCR^ 

[00201] RT-PCR was performed using an EZ-RT-PCR kit (Perkin-Elmer, Foster, CA). 

Each reaction contained 0.03 to 1.0 jug of total RNA and 80 ng of a 5' and 3' specific primer 
in a 40 /ul reaction volume. RT-PCR products were electrophoresed on 2% Seaken agarose 
gels. The PTM- and target-specific oligonucleotides used to generate ^raw-spliced products 
are 5'-CGCTGGAAAAACGAGCTTGTTG-3 ! (primer CF93) and 
5'-ACTCAGTGTGATTCCACCTTCTC-3' (primer CF1 11), respectively. The PTM- and 
target-specific oligonucleotides used to generate czs-spliced products were CF1 and CF93. 

The sequence of oligonucleotide CF1 is 

5 ! -GACCTCTGCAGACTTCACTTCTAATGATGATTATGG-3\ 

[00202] The repair model in Fig. 33 shows a portion of a target CFTR pre-mRNA 
consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini- 
intron 10 and exons 11-24 (Fig. 33). The PTM shown in the figure consists of exon 1-10 
coding sequences (containing codon 508) and a fraws-splicing domain with its own splicing 
elements (donor site, branchpoint and pyrimidine tract) and a binding domain. Several 
PTMs have been constructed with different binding domains. Three examples are shown in 
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Figure 34. In Fig. 34A the binding domain is complementary to the splice site of intron 9 
and part of exon 10 (3 ! end; CF-PTM 1 1). In Fig. 34B the PTM has an extended binding 
domain which also covers the 5' end of exon 10 and the 3' splice site of intron 9 (CF-PTM 
20). In the last example (Fig. 34C) the binding domain is the same as that shown in panel 
B except the binding domain extends the full-length of exon 10 (CF-PTM 30). In the latter 
case the PTM exon 10 has modified codon usage to reduce antisense effects with it's own 
binding domain (Fig. 34). Further examples of binding domains are shown in Figure 35. 
[00203] Figure 36 shows the sequence of cis- and £rara-spliced products. The top panel 
of Fig. 36A shows target exon 10 with it's three missing nucleotides (CTT), whilst the lower 
panel shows exon 10 and 1 1 of the target correctly spliced together. Figure 36B is a partial 
sequence of a single PCR product showing the modified codons in exon 10 of the PTM 
(upper panel), codon 508 in exon 10 of the PTM (middle panel), and PTM exon 10 correctly 
spliced to target exon 1 1 (lower panel), indicating that frans-splicing is accurate. The 
sequence of the repaired target was generated by RT-PCR followed by PCR. 

11. EXAMPLE: PTMs WITH A LONG BINDING 
DOMAIN, WHICH MAY BE DISCONTIN- 
UOUS, HAVE INCREASED m4Atf-SPLICING 
EFFICIENCY AND SPECIFICITY 

11.1. MATERIALS AND METHODS 

11.1.1. CELL CULTURE 

[00204] Human embryonic kidney cells (293 or 293T) were from the University of 

North Carolina tissue culture facility at Chapel Hill (Chapel Hill, NC). Cells were 

maintained at 37 °C in a humidified incubator with 5% C0 2 in Dulbecco's modified Eagle's 
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medium (Life Technologies, Bethesda, MD) supplemented with 10% v/v fetal bovine serum 
(Hyclone, Logan, UT). Cells were passaged every 2-3 days using 0.5% trypsin and re-plated 
at the desired density. Stable cells, expressing an endogenous mutant lacZ pre-mRNA 
(lacZCF9) were maintained in the presence of 0.5 mg/ml G418 (Calbiochem, San Diego, 
CA). 

11.1.2. RECOMBINANT PLASMTDS 
[00205] Targets: pc3.11acZCF9, pc3.11acZCF9m, and pc3.11acZHCGlm. 
pc3.11acZCF9 encodes for a normal lacZ pre-mRNA was constructed using lacZ coding 
sequences nucleotides 1-1788 as 5' exon, CFTR mini-intron 9 followed by lacZ coding 
sequences nucleotides 1789-3174 as 3' exon. This is similar to pc3.11acZ-T2 construct but 
without stop codons in the lacZ 3 ! exon and has CFTR mini-intron 9 instead of pHCG6 
intron 1 (Fig. 37A). CFTR mini-intron 9 was PCR amplified using plasmid T5 as template 
and primers CFIN-9F (5 -CTAGGATCCCGTTCTTTTGTTCTTCACT ATTAA) and CFIN- 
9R (5 -CTA GGGTT ACC GA A GT A A A A CC A T A C1TT A TT A G restriction sites underlined), 
digested with BamH I and BstE II and cloned in place of BHCG6 intron 1 of pc3. llacZ-T2 
plasmid. pc3.11acZCF9m expresses a defective lacZ pre-mRNA and is identical to 
pc3.11acZCF9 but contains two in-frame non-sense codons in the 3 ! exon (Fig. 37A). 
pc3.11acZHCGlm is a chimeric target, which includes the lacZ 5' exon followed by intron 
1 and exon 2 of PHCG6. This is similar to pc3.11acZCF9m except that it contains exon 2 
of pHCG6 in place of mutant lacZ V exon. pHCG6 exon 2 was PCR amplified using 
PHCG6 plasmid (accession # X00266) as template DNA and primers HCGEx-2F (5'- 
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GCATGGHACCCTGCAGGGGCTGCTGCTGTTGCTG) and HCGEx-2R (5'- 
CTGAAAGCTTGTTAACCAGCTCACCATGGTGGGGCAG, restriction sites underlined) 
digested with BstE II and Hind III and cloned in place of the lacZ 3' exon of pc3. llacZCF9m. 
Plasmid pcDNA3.1/HisB//acZ (Invitrogen, Carlsbad, CA) was used as DNA template to 
produce 5 ? and 3 f lacZ exons. The lacZ 5' exon is 1788 bp long, has an ATG initiation codon, 
lacZ y exon (without stop codons) is 1385 bp long and has a transcription termination signal 
at the end of the 3 1 exon. CFTR mini-intron 9 and pHCG6 intron 1 are 548 bp and 352 bp 
in size, respectively, and both have 5' and 3' splice signals. Exon 2 of (3HCG6 is 162 bp long 
and has a transcription termination signal at the end of the exon. 

[00206] Pre-trans-splicing Molecules (PTMs): PTM-CF14 is an identical version of 
pcPTMl with minor modifications in the ^raws-splicing domain (Fig. 37B). PTM-CF14 is 
a linear version and contains a 23 bp antisense binding domain (BD) 
(5'-ACCCATCATTATTAGGTCATTAT) complementary to CFTR mini-intron 9, 18 bp 
spacer, a canonical branch point sequence (UACUAAC; BP) and an extended 
polypyrimidine tract (PPT) followed by normal lacZ 3 ? exon. PTM-CF22, PTM-CF24, 
PTM-CF26 and PTM-CF27 are identical to PTM-CF14 except they differ in length of the 
BD (Fig. 37B). sPTM-CFl 8 has a 32 bp BD, sPTM-CF22 and sPTM-CF24 contain the same 
BD as PTM-CF22 and PTM-CF24, respectively. In these PTMs, the binding domains were 
modified to create intra-molecular stem-loop structure ("safety") to mask the 3 f splice-site 
of the PTM. Different binding domains were produced by PCR amplification using specific 
primers (with unique Nhe I and Sac II sites) and a plasmid containing CFTR mini-intron 9 
as template. PCR products were digested with Nhe I and Sac II and cloned into a PTM 
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plasmid consisting of spacer sequences, 3 ! splice elements (BP, PPT and acceptor AG 
dinucleotide) followed by a normal lacZ 3 ! exon. 

11.1.3. TRANSFECTION OF PLASMID DNAs INTO 293T CELLS 
[00207] The day before transfection, 1 x 10 6 293T cells were plated on 60 mm plates 
coated with Poly-D-lysine (Sigma, St. Louis, MO) to enhance the adherence of cells and 
grown for 24 hr at 37 °C. Cells were transfected with expression plasmids using 
LipofectaminePlus reagent according to standard protocols (Life Technologies, Bethesda, 
MD). In a typical co-transfection, 2 jug of pc3.11acZCF9m target and 1.5 /ug of PTM 
expression plasmids were transfected into cells and for controls (target and PTM alone 
transfections) total DNA concentration was normalized to 3.5 /ug with pcDNA3.1 vector. 
[00208] Forty-eight hours after transfection the plates were rinsed with PBS, cells 
harvested and total RNA or DNA was isolated using MasterPure RNA/DNA purification kit 
(Epicenter Technologies, Madison, WI). Contaminating DNA in the RNA preparation was 
removed by treating with DNase I, while, contaminating RNA in the DNA preparation was 
removed by digesting with RNase A at 37°C for 30-45 min. 

11.1.4. REVERSE TRANSCRIPTION-POLYMERASE 
CHAIN REACTION fRT-PCR^I 

[00209] RT-PCR was performed as suggested by manufacturer using an EZ rTth RNA 

PCR kit (Perkins-Elmer, Foster City, CA). A typical reaction (50 jjX) contained 25-500 ng 

of total RNA, 100 ng of 5 1 target specific primer (common to cis- and fraws-spliced products) 
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(Lac-9F, 5'-GATCAAATCTGTCGATCCTTCC) and 100 ng of 3 ! primer (Lac-3R, 
5'-CTGATCCACCCAGTCCCATTA, target specific primer for cw-splicing, and Lac-5R, 
5'-GACTGATCCACCCAGTCCCAGA 5 PTM specific primer for trans-splicing), IX reverse 
transcription buffer (100 mM Tris-HCl, pH 8.3, 900 mM KCL with 1 mM MnCl 2 ), 200 
dNTPs and 10 units of rTth DNA polymerase. RT reactions were performed at 60 °C for 45 
min. followed by 30 sec pre-heating at 94°C and 25-35 cycles of PCR amplification at 94°C 
for 18 sec, annealing and extension at 60 °C for 1 min followed by a final extension at 70 °C 
for 7 min. The reaction products were analyzed by agarose gel electrophoresis. 

11.1.5. PROTEIN PREPARATION AND ft-GAL ASSAY 
[00210] Total cellular protein from cells transfected with expression plasmids was 
isolated by freeze thaw method and assayed for P-galactosidase activity using a P-gal assay 
kit (Invitrogen, Carlsbad, CA). Protein concentration was measured by the dye-binding 
assay using Bio-Rad protein assay reagents (BIO-RAD, Hercules, CA). 

11.1.6. WESTERN BLOT 
[00211] About 5-25 //g of total protein was electrophoresed on a 7.5% SDS-PAGE gel 
and electroblotted onto PVDF-P membrane (Millipore). After blocking for 1 hr at room 
temperature (blocking buffer: 5% dry milk and 0.1% Tween-20 in IX PBS), the blot was 
incubated with a 1:2500 dilution of polyclonal rabbit anti-p-galactosidase antibody for 1 hr 
at room temperature (Research Diagnostics Inc. NJ), washed 3x with blocking buffer and 
then incubated with a 1 :5000 diluted anti-rabbit HRP conjugated secondary antibody. After 
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incubating at room temperature for 1 hr, it was washed 3x in blocking buffer and developed 
using ECLPlus Western blotting reagents (Amersham Pharmacia Biotech, Piscataway, NJ). 

11.1.7. IN SITU p-GAL STAINING 
[00212] Cells were monitored for the expression of functional P-galactosidase using a 
P-gal staining kit (Invitrogen, Carlsbad, CA). The percentage of p-gal positive cells were 
determined by counting stained vs. unstained cells in 5-10 randomly selected fields. 

11.1.8. SELECTION OF NEOMYCIN RESISTANT CLONES 
EXPRESSING AN ENDOGENOUS DEFECTIVE lacZ 
PRE-mRNA TARGET 

[00213] On day 1, 1 x 10 6 293 cells were plated on 60 mm plates and grown for 24 hr 

at 37°C. On day 2, the cells were transfected with 2 /zg of pc3.11acZCF9m using 

LipofectaminePlus transfection reagent as described above. 48 hr post-transfection, cells 

were split (1:20 ratio) and grown in media containing 0.5 mg/ml G418. At the end of 

2 weeks, neomycin resistant colonies were selected, pooled, expanded and maintained 

constantly in the presence of G41 8. 

11.2. RESULTS 

[00214] A model system was developed that permits facile and versatile analysis of 
spliceosome mediated RNA rraw-splicing in cells. The bacterial lacZ gene was split with 
a truncated intron 9 from the Cystic Fibrosis Transmembrane Conductance Regulator 
(CFTR) gene (Figure 37A). This split lacZ gene, when introduced into human 293T cells, 
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directed the synthesis of a lacZ pre-mRNA that could splice properly. The open reading 
frame of the lacZ gene was mutated by insertion of two in-frame nonsense codons near the 
5' end of the second exon (Figure 37A). This lacZ gene is referred to as lacZCF9m. In 293T 
cells, lacZCF9m directs the synthesis of lacZCF9m pre-mRNA ? which encodes a truncated 
P-galactosidase (p-gal) protein that does not have enzymatic activity. Cells bearing the 
lacZCF9m gene are a model system for genetic disorders caused by loss of function 
mutations. 

[00215] Pre-frYms-splicing molecules (PTMs) were designed to ^raws-splice with 
lacZCF9m pre-mRNA and repair the mutation caused by the two nonsense codons. PTMs 
were constructed with binding domains spanning 23, 91 and 153 nucleotides (nt), which we 
named PTM-CF14, PTM-CF22 and PTM-CF24 (Figure 37B). The PTM-CF24 binding 
domain does not bind 153 contiguous nt in the targeted CFTR gene intron 9, but rather 
creates a loop of 47 nt in the target in between two regions of complementary of 27 and 126 
nt (Figure 37B). These PTMs were predicted to repair the deficiency created by lacZCF9m 
(Figure 37C). 

[0021 6] Semi-quantitative RT-PCR analysis was used to tests the efficiency of trans- 
splicing mediated by PTMs with long target binding domains. Repair of lacZCF9m 
transcripts by £ra«s-splicing was tested in two different ways: co-transfection of PTM and 
target (lacZCF9m) plasmids or transfection of cells that had been modified to express the 
target as an endogenous pre-mRNA. Co-transfecting plasmids encoding PTMs with the 
lacZCF9m plasmid provided a facile method for screening the former for efficiency. PTM- 
CF22 and PTM-CF24 were approximately 3-fold and 10-fold more efficient than PTM-CF14 
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in a semi-quantitative RT-PCR assay suggesting a significant improvement in mRNA repair 
(Figure 38). Sequencing of the RT-PCR products showed that /raws-splicing was accurate, 
resulting in proper ligation of the exons from the target and the PTM. Moreover, mutation 
of key c/s-acting elements in the 3' splice site of the PTMs resulted in an abrogation of trans- 
splicing. In these and all other assays described herein controls were carried out to rule out 
recombination at the DNA level. Thus, repair of the lacZCF9m transcripts was a result of 
targeted RNA trans-splicing. 

[00217] Transfection of PTM-CF14, -CF22 or -CF24 into 293 cells bearing an 
endogenous lacZCF9m gene confirmed that the longer target binding domains provided the 
PTMs with higher efficiency (Figure 38B). It should be noted that similar levels of RT-PCR 
trans-splicing specific product were obtained after 30 PCR cycles and 35 cycles for PTM- 
CF24 and PTM-CF14, respectively. The data therefore suggests that PTMs with long 
binding domains repaired lacZCF9m transcripts at least an order of magnitude better than 
previously described PTMs. 

[0021 8] More than one in ten transcripts of lacZCF9m can be repaired by trans-splicing. 
Quantitative, real-time PCR was used to measure the fraction of lacZCF9m transcripts 
repaired by PTMs with long binding domains. The co-transfection assay described above 
was used in these experiments. PTM-CF14, which contains a binding domain of 23 nt, was 
shown to repair between 1.2 and 1.6% of lacZCF9m RNAs in 293T cells and 2.1% of 
lacZCF9m RNAs in the H1299 human lung cancer cells. PTM-CF24, which has a 153 not 
long binding domain, was significantly more efficient, correcting between 12.1 and 15.2% 
of lacZCF9m RNAs in 293T cells and 19.7% in H1299 cells. This in effect resulted in a 
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measurable reduction in the levels of lacZCF9m mRNA. These data also confirmed the 
remarkable capability of this RT-PCR assay to distinguish between the products of cis- 
splicing, the lacZCF9m and mRNA, and the products of rra/w-splicing, repaired lacZCF9m 
mRNA. This is the first true quantification of the efficacy of /ra«s-splicing mediated mRNA 
repair at the RNA level. These data confirm the suggestions of the semi-quantitative RT- 
PCR analysis shown above. Similar experiments were carried out using 293 cells that 
express an endogenous lacZCF9m pre-mRNA target. Consistent with the data shown above, 
PTM-CF24 was ten times more efficient than PTM-CF14, with the former correcting 
between 1.3 and 4.1% of endogenous lacZCF9m transcripts. These data confirmed that 
increasing the length of the PTMs provided a remarkable enhancement in trans-splicing 
efficiency. 

[00219] 7raws-splicing mediated mRNA repair results in the synthesis of active 
p-galactosidase. At the cellular level, the ultimate criterion for the success of mRNA repair 
is the production of an active protein. Using a western assay it was determined that full- 
length p-gal was produced as a result of ^raw-splicing . Full-length P-gal was not observed 
following transfection of 293T cells with plasmids encoding lacZCF9m or PTM-CF24. Co- 
transfection of both plasmids, however, resulted in robust production of full-length p-gal 
protein, which was readily detectable using anti-P-gal antiserum (Figure 39). This result 
complements enzymatic activity data suggests that the latter was not due to a 
complementation by truncated p-gal proteins. The Western blot analysis revealed that full- 
length p-gal protein was made in 293T cells by ^raws-splicing and furthermore confirmed 
that the PTMs with long binding domains were efficiently spliced. 
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[00220] Appropriate repair of p-gal mRNA and synthesis of full-length p-gal protein 
should lead to the production of active enzyme. Indeed, 293T cells co-transfected with 
lacZCF9m and PTM-CF24 were shown to have p-gal activity measured either in situ (Figure 
40A) or in extracts (Figure 40B). This activity was shown to depend on the *raws-splicing 
between the target pre-mRNA and the PTM. The quantitative in solution assay further 
confirmed the data presented above: PTM-CF22 and PTM-CF24 were 2.9 and 9.3 fold more 
efficient respectively than PTM-CF14. Most impressive, however, were results using 293 
cells that harbor lacZCF9m as a stable endogenous gene. When these cells were transfected 
with PTM-CF14 the levels of p-gal activity obtained were barely above background. 
Transfection with PTM-CF24, however, resulted in a considerable level of p-gal activity 
(Figure 40C). This was paralleled by the appearance of full-length p-gal protein. These data 
demonstrate a sizeable increase in the efficiency of /raws-splicing to repair a mutated pre- 
mRNA. In fact all prior reports of repair of endogenous RNA in mammalian cells by either 
group I ribozymes or /raws-splicing have been only documented using RT-PCR, an 
indication of the low level of repair. 

[00221] PTMs with very long binding domains are highly specific. It was shown that 
a secondary structure within the binding domain could enhance specificity of PTMs in HeLa 
nuclear extracts. In order to ascertain the specificity of the ^raws-splicing reactions in vivo 
a second target gene was prepared, which could serve as reporter of non-specific reactions. 
This gene, which is referred to as lacZHCGlm, shares the first exon with lacZCF9m. The 
intron in lacZHCGlm is intron 1 of the p-subunit of the human chorionic gonadotropin gene 
6 (phCG6) and the second exon is exon 2 of the same gene. lacZHCGlm drives the 
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synthesis of a pre-mRNA that is spliced correctly to yield a chimeric mRNA that does not 
encode a full-length p-gal (see below). PTM-CF14, -CF22 and -CF24 are not targeted to 
lacZHCGlm pre-mRNA since there is no complementarity between the binding domains in 
these PTMs and the target gene. Any frara-splicing between these PTMs and lacZHCGlm 
pre-mRNA is therefore non-specific (Figure 41 A). 

[00222] 293T cells were transfected with PTM-CF14, -CF22 or -CF24 and the level of 
non-specific trans-splicing was determined by RT-PCR and by in solution p-gal assays. 
Semi-quantitative RT-PCR suggested that PTM-CF24 was significantly less likely than 
PTM-CF14 to trans-splice with lacZHCGlm pre-mRNA. Measurement of P-gal activity 
confirmed this; cells co-transfected with lacZHCGlm and PTM-CF24 produced 3.7 fold less 
P-gal than those co-transfected with lacZHCGlm and PTM-CF14 (Figure 41C). Based on 
these data it was estimated that PTM-CF24 is 50 times more likely to trans-splice to its 
target than to a non-specific target. A "safety" version of PTM-CF24, sPTM-CF24, did not 
confer further specificity (Figure 41C). Nonetheless, for PTMs with shorter binding domains 
a "safety" stem involving the binding domain was seen to improve specificity in vivo (Figure 
41C). It was concluded from these data that the longer binding domains resulted in PTMs 
that were not only more efficient but also more specific. 

[00223] The observation that long binding domains increased the specificity of PTMs 
suggested that very long binding domains (>200 nt) could further enhance discrimination. 
Plasmids encoding PTM-CF26 and -CF27, which have binding domains that span 200 nt 
and 411 nt respectively, were constructed and co-transfected with lacZHCGlm plasmid. 
Non-specific trans-splicing of these two PTMs was barely detectable with RT-PCR (Figure 
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41B). As measured by the P-gal assay PTM-CF26 and -CF27 had minimal non-specific 
^raws-splicing activity (Figure 41C). In a specific tarns-splicing reaction with lacZCF9m as 
measured by the solution p-gal assay PTM-CF26 was as active as PTM-CF14 (Figure 41B). 
It was estimated that PTM-CF26 is 80 times more likely to trans-splice to the specific target 
(lacZCF9m) than to a non-specific target (lacZHCGlm). Therefore, inclusion of very long 
binding domains confers to these PTMs very high specificity. 



12. EXAMPLE: CORRECTION OF THE FACTOR VIII GENE 
USING y EXON REPLACEMENT 

[00224] Hemophilia is a bleeding disorder caused by a deficiency in one of the blood 

clotting factors. Hemophilia A, which accounts for about 80 percent of all cases is a 

deficiency in clotting factor VIII. The following section describes the successful repair of 

the clotting factor VIII gene using spliceosome mediated /raws-splicing and demonstrates the 

feasibility of repairing the factor VIII using gene therapy. 

[00225] The coding region for mouse factor VIII PTM (exons 16-24) was PCR 
amplified from a cDNA plasmid template using primers that included unique restriction sites 
for directed cloning. All PCR products were generated with cloned Pfu DNA Polymerase 
(Stratagene, La Jolla, CA). The coding sequence was cloned into pc3.1DNA(-) using EcoRV 
and Pmel restriction sites. The binding domain (BD) was created by PCR using genomic 
DNA as a template. Primers included unique restriction sites for directed cloning. The PCR 
product was cloned into an existing PTM plasmid (PTM-CF24, pc3.1DNA) using Nhel and 
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SacII restriction sites. This plasmid already contained the remaining elements of the TSD 
including a spacer sequence, polypyrimidine tract (PPT), branchpoint (BP) and 3' acceptor 
site. The whole of the TSD was then subcloned into the vector (described above) containing 
the factor VIII PTM coding sequences. Finally, bovine growth hormone 3 1 untranslated 
sequences from a separate plasmid clone were subcloned into the above PTM using Pmel 
and BamHI restriction sites. 

[00226] The whole construct was sequenced and then analyzed by RT-PCR for possible 
cryptic splicing, and then subcloned into the AAV plasmid pDLZ20-M2 using Xhol and 
BamHI restriction sites (Chao et al, 2000, Gene Therapy 95:1594-1599; Flotte and Carter, 
1998, Methods EnzymoL, 292:717-32). For some viral (and non-viral) delivery systems, the 
size of the therapeutic is essential. Viral vectors such as adeno-associated virus are preferred 
because they are a (i) non-pathogenic virus with a broad host range (ii) it induces a low 
inflammatory response when compared to adenovirus vectors and (iii) it has the ability to 
infect both dividing and non-dividing cells. However, the packaging capacity of the rAAV 
is limited to approximately 110% of the size of the wild type genome, or -4.9 kB, thus, 
leaving little room for large regulatory elements such as promoters and enhancers. The B- 
domian deleted human factor VIII is close to the packaging size of AAV , thus, trans- 
splicing offers the possibility of delivering a smaller transgene while permitting the addition 
of regulatory elements. 

[00227] To eliminate cryptic donor sites in the pre-mRNA upstream of the Xhol PTM 
cloning site approximately 170 bp of sequence was eliminated from the original AAV 
construct that includes part of exon 1 and all of the intron 1 sequence (see Fig. 44C). 
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[00228] The repair model in Fig. 44D shows a simplified model of the mouse factor 
VIII pre-mRNA target (endogenous gene) consisting of exons 1-14, intron 14, exon 15, 
intron 16, and exon 16-24 containing a neomycin gene insertion. The PTM shown in the 
figure consists of exon 16-24 coding sequences and a trans-splicing domain with its own 
splicing elements (donor site, branchpoint and pyrimidine tract) and a binding domain. 
Details of the binding domain are shown in Fig. 44A and 44B. The binding domain is 
complementary to the splice site of intron 15 and part of exon 16 (5 ? end). 
[00229] The key advantages of using 3' exon replacement for gene repair are (i) the 
construct requires less sequence and space than a full length gene construct, thereby leaving 
more space for regulatory elements, (ii) SMaRT repair should only occur in those cells that 
express the target gene, therefore eliminating any potential problems associated with ectopic 
expression of repaired RNA. 

[00230] Factor VIII deficient mice were maintained at the animal facilities at the 
University of North Carolina at Chapel Hill. For plasmid injections each mouse was sedated 
and placed under a dissecting microscope and a 1 cm vertical midline abdomen incision was 
made. Approximately 100 micrograms of PTM plasmid DNA in phosphate buffered saline 
was injected to liver portal vein. Blood was collected from the retro-orbital plexus at 
intervals of 1, 2, 3 and 20 days after injection and assayed for factor VIII activity using the 
Coatest assay. 

[00231] Factor VIII activity in blood samples collected from mice were assayed using 
a standard test called the Coatest assay. The assay was performed according to 
manufacturers instructions (Chromgenix AB, Milan, Italy). Data indicating repair of factor 
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VIII in factor VIII knock out mice is demonstrated in Figure 46. 

[00232] Haemophilia A defects in humans are broadly split into several categories that 
include gross DNA rearrangements, single DNA base substitutions, deletions and insertions. 
It has been determined that a rearrangement of DNA involving an inversion and translocation 
of exons 1-22 (together with introns) away from exons 23-26 is responsible for -40% of all 
cases of severe haemophilia A. The canine hemophilia A model also has a very similar gross 
rearrangement. This mutation will be used as the basis for our human and canine factor VIII 
PTM designs 

[00233] Methods for building the human factor VIII PTM will be very similar to that 
described above for the mouse PTM except that different coding regions (exons 23-26) will 
be amplified from a human cDNA, the binding domain will be amplified from human 
genomic sequence templates (whole genomic DNA or a genomic clone), and a C-terminal 
FLAG tag will be engineered in the PTM to be used to detect repaired factor VIII protein. 
The remaining elements of the trans-splicing domain including a spacer sequence, 
polypyrimidine tract (PPT), branchpoint (BP) and 3' acceptor site will be obtained from an 
existing plasmid. Where necessary changes will be made to the binding domain sequence to 
eliminate any cryptic splicing within the PTM. The final PTM will be subcloned into the 
same mouse AAV plasmid vector, pDLZ20-M2 and virus preparation made from this 
plasmid. The canine factor VIII PTM will be made in an identical fashion but using canine 
cDNA and genomic plasmid 

[00234] The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those 
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described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying Figures. Such modifications are intended to fall within the 
scope of the appended claims. Various references are cited herein, the disclosure of which 
are incorporated by reference in their entireties. 
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